FIGURE l 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT TACTGCAGTAACACTCCACCAT ATAGACC CGGCT T TACC T TATATCAGTGACAC T GG 
TACAGTAGCTCCAGAAZ^AATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
C T AC CAT T TATGT TC GT TATAAGCAAGTT CAT GCTC T GAGTC C TGAAGAGAACGT TAT CATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGT T T T T T CCT GAC T TACATT CGTGAT T T T CAGAAAATT TC TT TACGGGTGGAAGCCAAT TT ACA 
T GGAT TAACCC TC TATGACAC TGCACC TT GCCC TAT TAACAATGAAC GAACACGGCT ACTT T CCA 
GAGAT AT T TGATGAAAGGATAAAATAT T T CT GTAAT GAT TAT GATTCT CAGGGAT TGGGGAAAGG 
T TCACAGAAGT TGC T TAT T CT T C TCT GAAATT T T CAACCACT TAATCAAGGC TGACAGT AACACT 
GATGAAT GCT GATAAT CAGGAAACATGAAAGAAGCCAT T T GATAGAT TAT TC TAAAGGATATCAT 
CAAGAAGACT AT TAAAAACACC T ATGCC TATAC T T T T T TAT CTC AGAAAAT AAAG TCAAA&GAC T 
ATG 



FIGURE 2 

<subunit 1 of 1, 2 66 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGWLMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

f*\ Important features : 

III Type II transmembrane domain: 

W amino acids 13-33 

111 

if? 

"f* Other Transmembrane domains: 

^ amino acids 54-73, 94-113, 160-180, 122-141 

Ml N-myristoylation sites. 

\\ amino acids 57-63, 95-101, 99-105, 124-130, 183-189 

S 

m 
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FIGURE a 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACT T CGCC ATGA GT T TCCTCATCGACT CCAGCAT CATGATTACC TCCCAGATACTATTTT TT G 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CAT T CTCAGCCCAAAACAT GGGATCTTAT CCATAGAACAGCTCAT CAGCCGGGTT GGTGTGATT GGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATAT GCTACCAAGGAGAGAATAGAATACT CCAAAACCTT CAAGGGGAAATATTTTAATTTT CTT GGTT 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAACGGATCCTGTCACAAGAGGC AT TGAGAT CACT GTGAATTAT CTGGGAATCCAAT TTGATGT 
GAAGT T T T GGT CC CAAC ACAT TT C C T T CAT TC T T GT T GGAATAAT CAT C GT CACAT C CAT CAG AGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCT TGAAC TT AAGC CTACTACAGACTGT TAGAGGCCAGTGGTTT CAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCC TGACAT T TTATAAACAAACAAAATGCT ATGGTAGC 
ATTT TTCACCTTCATAGCATACT CCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
T GTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTC TGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTT TTAAGGTTCACAT GGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 



APP_ID=10063597 



Page 153 of 318 



FIGURE 4 



MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTimLLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHN^ 

LFLE TADLYATKERI E YSKT FKGKY FNFLGYFFS I YCVWKI FMAT IN I VFDRVGKTDPVTRGI E I 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS S VLL IRMSMPLE YRT 1 1 TEVLGELQFNFYHRWFDVI FLVSAL S S I LFLYLAHKQAPEKQMAP 



Important features: 
|f| Signal peptide: 

U$ amino acids 1-23 

m 

%Q 

% « Potential transmembrane domains: 

% " amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 

O 425-444 

m 

T% N-glycosylation sites. 

f\ amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 

AGCAGGGAAATCCGGATGTCTCGGTTAT GAAGTGGAGCAGT GAGTGTGAGCCTCAACATAGT TCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCGT TGTGAG(ZAAAAAGGCGAACCAGCAGCTGAATT TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGT T TGCAGCC TAT TGT TACAACT CATCT GAT ACT T GGACT AACTCGT GCAT T CCAGAA 
ATTATCACCACCAAAGATCCCATAT TCAACACTCAAACTGCAACACAAACAAGAGAAT TTATTGT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGC TCCAGC T TC CACT T CT AT TCCACGGAGAAAAAAAT T GAT T TGTGTCACAGAAGT T T T TATG 
GAAACTAGCACCAT GT CT ACAGAAAC T GAACCAT T T GT TGAAAATAAAGCAGCATT CAAGAAT GA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGT^AT 
C AGCAGAAGGAAAT GAT CGAAACCAAAGT AGTAAAGGAGGAGAAGGCCAAT GATAGCAACCCTAA 
TGAGGAAT CAAAGAAAAC T GATAAAAACCCAGAAGAGTCCAAGAGT CCAAGCAAAAC TACCGT GC 
GATGCC T GGAAGC TGAAGT T TAGATGAGACAGAA&T GAG G AG AC AC AC CT GAGGC TGGT T T C T T T 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAAT7VATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGT CACT GAGACCAAGGCT T TC T CTACTGAT T CCGCAGCT CAGACCC T T TC T T CA 
GCTCTGAAAGAGAAPlCACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
ACAGACAGGGTCAAAGTGT TTTCTC TGAACACATTGAGT TGGAATCACTGT TTAGAACACACACA 
C TTACT T T TT C T GGTCT CT ACCACT GCT GATAT TT T C TC TAGGAAATATAC TT TTACAAGTAACA 
AAAATAAAAAC TC T TAT AAAT T TCTAT T T T T ATC T GAGT TACAGAAAT GAT TAC TAAGGAAGAT T 
AC TCAGTAATT TGT T TAAAAAGT AATAAAAT TCAACAAACATT TGCT GAAT AGC TAC TATATG TC 
AAGT GCT GT GCAAGGTAT TACACTCT GTAAT TGAATATTAT TCC TCAAAAAATT GCACATAG TAG 
AAC GC TATCTGGGAAGCTAT TT T T T T CAGT TT T GATAT T T C T AGC T TATC TACT TCCAAACT AAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TAT TAT TAACATACC T AAGAAGTACAT TGT TACC TC TATATACCAAAGCACAT T T TAAAAGT GCC 
ATTAACAAAT GTAT CACTAGCCC TCCT T T T TCCAACAAGAAGGGACTGAGAGAT GCAGAAATAT T 
T G TGACAAAAAAT TAAAGCAT T TAGAAAACT T 



FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKASFETCSYGWGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
S SDTWTNSCI PE 1 1 TTKDPI FNTQTATQTTE FI VSDSTYSVAS PYST I PAPTTTPPAPASTS I PR 
RKKL I CVTEVFMET S TMS TE TEPFVENKAAFKNEAAG FGGVPTALLVLALL FFGAAAGLG FC YVK 
RYVKAFPFTNKNQQKEMIETKWKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site* 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 7 9-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 

GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 

CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 

CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 

GC TCTCAGC TACCCGCAGGAGGAGGCCACCCT CAATGAGATGTTCC GCGAGGTT GAGGAAC TGAT 

GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 

AAGCAT CAT CAGAAGT GAACC TGGCAAACTTACCT CCCAGCTATCACAAT GAGACCAACACAGAC 

ACGAAGGT TGGAAATAATACCATCCAT GTGCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 

TGGACAAATGGTCTT TT CAGAGACAGT TATCACATCTGT GGGAGACGAAGAAGGCAGAAGGAGCC 

ACGAGT GCAT CAT CGAC GAGGACTGTGGGCCCAGCATGTACT GCCAGT T T GCCAGC T TCCAGTAC 

ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 

GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 

ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 

ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 

CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 

CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 

GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 

CCAGGAGCTGGAGGACC T GGAGAGGAGCC TGAC TGAAGAGATGGCGC T GGGGGAGCC T GCGGCTG 

CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 

TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 

CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 

CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 

GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 

T T TG TT CT ACAT GGC T T T GATAATT GTT T GAGGGGAGGAGATGGAAACAAT GT GGAGT CT CC CTC 

TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 

CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 

AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 

TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 

AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 

CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 

C T CCACTACCCCACACCAGCCT T GGT GCCACCAAAAGTGCT CCCCAAAAGGAAGGAGAATGGGAT 

T T T T C TT GAGGCAT GCACAT C TGGAAT TAAGGT CAAAC TAATT C T CACAT CCCT CTAAAAGTAAA. 

CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 

ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 

TACAGGTTAACC T GCAGAAACAGTAC T TAGG TAATT G TAGGGCGAGGAT TATAAAT GAAAT T T GC 

AAAATCAC T T AGCAGCAAC T GAAGACAAT T ATCAACCACGT GGAGAAAATCAAACCGAGCAGGGC 

TGTGTGAAACAT GGT T GT AATATGCGACT GCGAACAC T GAACT CTACGCCACTCCACAAAT GATG 

TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 

CATGATTGTATAAGCATGCTT TC TTTGAGTT TTAAAT TATGTATAAACATAAGT TGCAT TTAGAA 

ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKAS SEWLANLPPS YHNETNTDTKVGNNT IHVHRE IHKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGT ICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPT^SRLLDL I TWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Is- Signal sequence: 

~h amino acids 1-19 

ff! 

Uj N-glycosylation site. 

HI amino acids 96-100, 106-110, 121-125, 204-208 

SJ 

3 Casein kinase II phosphorylation site. 

13 amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 

VI 327-331 

li. N~myristoylation site. 

fit amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
H CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
Q TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
Q GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
111 ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
^0 GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
iH AGAT TCT GGGCAAGAAGAACGAGGCGCC CCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC CGG 

Iff CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGG 
O CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
W TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
pj AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 

GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGATT TTAT TAAAGATATT TTGT TAACTC 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGYLGLFWTL 
NOTLALGQCVLAGAFASFYWAFHK^ 

EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE ll 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 

TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 

CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 

TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 

GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 

CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 

GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 

ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 

CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 

GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 

TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 

CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

CCMlGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 

TTTGTCACCTGGT CAGCCCTATCCAGT AT CCCT GAACAGAAAT GCAACCCCCATTT GCCAACCCAGCT 

GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 

TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 

AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 

GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 

ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 

ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 

CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 

CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 

CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 

ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 

AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 

TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 

GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 



FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLI LVGLTVGAFYI PDGS FTNI WFYFGWGS FLFI LIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNS GLLQASVI TLYTMFVTWSALS S I PEQKCNPHLPT QLGNET WAGPEGYE TQWWDAPS I 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
S FFHFCLVLASLHVMMTLTNWYKPGETRKMI S TWTAVWVKI CASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT T GGAAAAAGAC T CC TGTAACC CT CCT CCAGGATGAACC ACCT GCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCGATCAACCCCACACAA 
CTCATGGCCAGGAT TGAGTCC TATGAAGGAAGGGAAAAGAAAGGCATAT CT GATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCAT TGAGAACACAT T AGAGAAGGAGGT GATGCAGT AT GAC T AC TAT TCTT CAT AT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACT GCGCCAT T GGTGGGCAATAGCGT TGACAACGGCAGT GAC CAGTGC CT T T T TACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACT T TTAGAAC TATGAGTACTACT TT TGT T AAATGTGAAAAACCC T CACAGAAAGT C 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACTGCTGGCTT TAT TGAACAGCTAATAAAGAT TTAT TTAT TGTAATACCTCACAAACGT TGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
T CAGT GAGACTGAGC CTGAT GTGT TAACAAATAGGT GAAGAAAGTC T T GT GCTGTATTCCT AAT C 
AAAAGACTTAATATAT TGAAGTAACACTTT TTTAGTAAGCAAGATACCTT TT TATTTCAAT TCAC 
AGAAT GGAAT TTTTTTGTTT CAT GTCT CAGAT T TAT TT TGTAT T TCT T T T T TAACAC T CT ACAT T 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTTGCAAAAGGGGAAGAAAGGAATTGCGAATACATGTAAAATGTCACCAGACATTTGTATTATTT 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
AC AAAAT GAC T T AAAC CAT T C AT ATC AT GTTTCCTTT GCGTT CAGCC AAT T T CAAT TAAAATGAA 
CTAAATTAAAAA 



FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLW 1 1 ELNVNGGI ENT LEKEVMQYDYYS S YFDI FLLAVFRFKVL I LAYAVCRLRHWWAI ALTT 
AVTSAFLIAlKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 

ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 

CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 

GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 

CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 

CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 

AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 

TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 

CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 

CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 

AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 

CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 

GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 

TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 

CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 

TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCT'GGCGACCTCTCGGGCC 

TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 

TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 

CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 

CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 

TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 

CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 

ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 

GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 

GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 

AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 

GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 

GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 

GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 

CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GC TGGCTGCGGTGGGGGCAGCC TACTGT GT GCGGCGGGGGCGGGCCAT GGCAGCAGCGGCTCAGG 

ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 

CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 

ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 

GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 

ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 

GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 

CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 

CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 

TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 

GGC TGT GT GAC TC TAGTCT T GGCCCCAGGAAGCGAAGGAACAAAAGAAACT GGAAAGGAAGAT GC 

TTTAGGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCT 

GGGAAGATGTTTT TCAAACTCAGAGACAAGGACT TTGGTTT TTGTAAGACAAACGATGATATGAA 

GGCCT T T T GTAAGAAAAAATAAAAGAT GAAGT GT GAAA 



FIGURE 16 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 

MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 

LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 

ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 

AGIAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWTOESHVTLASP 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 

S PPS TAPPTVGPVPQPQDCPPS TCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 

VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 

TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 

LLAAIAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 

CEVPLMGFPGPGLQS PLHAKP Y I 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 57 9-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites . 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTC ATG CG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAG AAGG C AGC C AG GAT GAAT C C T TAG AT T C C AAGAC T AC T T T GACAT CAG AT GAG T C AG T A 
AAGGACCATAC TACTGCAGGCAGAGTAGTTGCTGGTCAAATAT TTCT TGAT TCAGAAGAATCTGA 
ATTAGAAT CCT CTAT TCAAGAAGAGGAAGACAGCC TCAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGATAT CAGC T TTCTAGAGT C T CCAAAT CCAGAAAACAAGGAC TAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
C TACAAC C TATGAC TACAAAGCAGAT GAAAAG TGGGGCT T T T GT GAAACT GAAGAAGAGGCT GC T 

u 

ffl AAGAGACGGCAGATGCAGGAAGCAGAAATGAT GT ATCAAAC T GGAAT GAAAATCC TTAATGGAAG 

|l| CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
y 1 CCAAAGCCCT GGAGAGAGT GT CATAT GCT CTT T TAT T TGGT GAT TAC T T GCCACAGAATATCCAG 

r;\ GCAGCGAGAGAGAT GT T T GAGAAGC TGAC T GAGGAAGGCT C TCCCAAGGGACAGACT GCT CT TGG 

, * CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
Q TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 
lH AATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTTTCAGCTTTCATGATC 
J'f CAGAT T T GC T TGTAT TAAGACCAAATAT T CAGT TGAACT T CCT TCAAAT T CT TGT T AAT GGAT AT 

*2 AACACATGGAATCTACATGTAAATGAAAGTTGGTGGAGTCCACAATTTTTCTTTAAAATGATTAG 

W 

fit TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 

GAATTGTCAGAATCATTTTTTACATTAGATTATCATAATTTTAAAAATTTTTCTTTAGTTTTTCA 
AAAT T T T G T AAAT GG T G GC T AT AGAAAAAC AAC AT GAAAT AT T AT ACAAT AT T T T GC AAC AAT GC 
C C TAAGAAT TGT T AAAAT TCATGGAGT TAT TTGT GCAGAAT GAC T CCAGAGAGC TCTACT T T CTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGATTAACTCATTTTTAATAAAATTATGTCTAAGATTAAAAAAAAAAAAAAAAA?\AAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDSSLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
E S S I QEEEDSLKS QEGE SVTEDI S FLE S PNPENKDYEE PKKVRKPALTAIEGTAHGE PCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALVY YT FGALGGNL I AHMVLVS RL 



Important features: 
Signal peptide: 

#n amino acids 1-21 

p| N-glycosylation sites. 



amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myri s toyl a tion si te s . 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 



Fructose-bisphosphate aldolase class- II protein. 

amino acids 101-119 
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FIGURE iq 



AAT T CAGAT T T TAAGCCCAT TC TGCAGT GGAAT TTCATGAACTAGCAAGAGGACACCATC T T CT T 
GTAT TATACAAGAAAGGAG TG TACCTAT CACACACAGGGGGAAAAATGC T C T TT TGGGTGCT AGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGATTTCATGTAATGGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACC T CAGAGAGAC T TCGT AC TGTGCT TC TG.GATGTGACCGAC CCAGAGAAT GT CAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
M C CCGGCGT GCT GGC TCCCACT GACT GGC T GACACTAGAGGACTACAGAGAACC TAT TGAAG T GAA 

jj| CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
:^ TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
y| TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
Iff ATGCATTGAACCAGGATTGTTCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 
tfl TCGC CAT T T GGGAGCAGC TGT C T CCAGACATCAAACAACAAT AT GGAGAAGGT TACAT TGAAAAA 

N AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 

at 

CAT GG AC C AC GC T C T AAC AAG TCTCTTCCC T AAG AC T CAT T AT GC C GC T G GAAAAGAT G C C AAAA 

M 

in T T T TC TGGATACC T C T GT CTCACATGCCAGCAGC T T T GCAAGAC T T T T TATT GT TGAAACAGAAA 

Q GCAGAGCTGGCTAATCCCAAGGCAGTG TGA CTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

5r: CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

FIJ 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTG7\ATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKMTSERLRTVLLDVTDPENVKRTAQWVKNQVGEKGLWGLIMAGVPGVIAPTDW 
REPIEVNLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKIA^ 

LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites, 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACT GAT T CCCAAAT GGATGAT GT TGAAGT TGT T TATACAAT TGA 
CATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGT CAT T CAGATCAGATCAT GACGT T TAGAGAGAGGCTGCTT CACAAAAACT TGCAGGAGCAT T T 
TTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAATAACAGAAAGCTGCTCTACTC 
ATCGACTGGAACATTCCTTATATAAACCT CAAAAAGGACTTTT TCACAGGGTACCTT TAGTGGT T 
GCCAATCTGGGCATGTCTG-AACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T TT TAGCCGAGCAGTACAAACACACAGC T C TAAAT T T T T T GAAGAAGATGGATCCT TAAAGGAGG 
TACATAAGAT AAAT GAAAT GTATGC T TCAT TACAAGAGGAAT TAAAGAGT ATAT GCAAAAAAGTG 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGAT TAAAACGAGAAAT TGA 
GAAAAGGAGAGGAGCACAGAT TCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCC T CAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
AT G T C T T T AAAAAAT AG AC AT G T T T C T AAAAG TAG C T G T AAC T AC AAC C AC CAT C T CG AT G TAG T 
AGACAATC TGACC T TAAT GGTAGAACACAC TGACAT TCCT GAAGCTAGTCCAGCTAGTACACCAC 
AAAT CAT T AAGCATAAAGCCT TAGACTTAGAT GACAGAT GGCAAT T CAAGAGATC TCGGT TGTTA 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGAT GAAGAAAT T GAAAAGATGAAGGGT T T TGGTGAATAT TCACGGTCT C 
CTACATTT TGATCC TT TTAACC TTACAAGGAGAT TT T TT TATTTGGCT GATGGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTT TGCAGTAATACACAGATAACTCTTAGTGCATTTACT TCACAAAGTACTTTT TCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
T AGGGAAG ACAAG T CAGGAGGAT T GAT T GAAGC TAGG AG T T AGAGAC CAGCCTGGGCAACGT AT T 
GAGACC AT G TC TAT TAAAAAATAAAAT GGAAAAGCAAGAATAGCCT TATT T T CAAAATATGGAAA 
GAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAGTGATACTTTTTTAGAAGTA 
CATTAT GGCTAGAGT TGCCAGATAAAAT GCT GGATATCAT GCAATAAAT TT GCAAAACAT CATC T 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 22 



■■ -.1 

Ill 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEVVYTIDIQKYI 
PCYQLFSFYNSSGEWEQALKKILSNVKKNVVGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPS 1 1 TESCS THRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
Q I QAAREKNI QKDPQENI FLCQALRT FFPNSE FLHSCYMSLKNRHVS KS S CNYNHHLDWDNLTL 
MVEHTDI PEAS PAS TPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 
amino acids 1-19 



N-glycosylation sites. 

-I 

amino acids 75-79, 322-326 

HI N-myristoylation site. 

%sl amino acids 184-154 



Growth factor and cytokines receptors family. 

Ill 

amino acids 134-150 
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FIGURE 2ft 



GG CACAGCC GCG C GGCGGAGGGCAGAGT CAGC CGAGCCGAGT C CAGCC GGACGAGCGGACCAGCGCAG GGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
? . AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
i*% CAAGAGATGGAGGAGAAGTACAGCCTGCAAAGTAAAGATGATTTCAAAAGTGTGGTCAGCGAACAGTGCAATCATTTG 
CAAGCTGT CTTT GCTT CACGTT ACAAGAAGTTT GAT GAAT T CTT CAAAGAACT ACTTGAAAAT G CAGAGAAAT C CCT G 

u 

y ^ AAT GAT AT GTT T GT GAAGACATAT GGC CATTTATACAT GCAAAATT CT GAG CTATT TAAAGAT C T CTT CGTAGAGTT G 

r i AAACGTTACTACGT GGT GGGAAAT GT GAACCT GGAAGAAAT G CT AAAT GACT T CT GGGCT CG C CT CC TGGAGCGGAT G 

TTCCGCCTGGTGAACTCCCIAGTACCACTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 

MJ 

% s TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
AT GAT CTACT GCT CC CAC T GCC GGGGT CT C GT GACT GT GAAGC CAT GTTACAACTACT GCT C AAACAT CAT GAGAGGC 

3D 

#«* TGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 
1 1\ GAGGGTCCTTT CAACATT GAAT CGGT CATGGAT CCCATCGAT GTGAAGATTTCTGATGCTATTATGAACAT GCAGGAT 

pi AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
Ips TC CAT CT CTGAAAGT G C CTT CAGTGCT CGCTT CAGACCACAT CACC CC GAGGAACG CCCAAC CACAGCAGCT GGCACT 

AGTTT GGACCGACTGGT TACT GATGT CAAGGAGAAACT GAAACAGGC CAAGAAATT CT GGTCCTCCCT T CC GAGCAAC 
iff s GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 
CTGTTT G CAGT GACAGGAAAT GGAT TAGCCAACCAGGGCAACAACCCAGAGGT CCAGGTT GACACCAGCAAACCAGAC 
ATACT GAT CCT T CGT CAAAT CATGGCTCTT C GAGT GATGACCAGCAAGAT GAAGAATGCAT ACAAT GGGAAC GACGT G 
GACTT CTTTGATAT CAGT GAT GAAAGTAGT GGAGAAGGAAGT GGAAGT GGCT GTGAGTAT CAGCAGT GCCCTTCAGAG 
T T T GACTACAAT GCCACT GACCATGCT GGGAAGAGT GC CAAT GAGAAAGC CGACAGTGCT GGT GT CC GTC CT GGGGCA 
CAGGC CTACCT C CTCACT GT CTT CT GCATCT T GTT C CT GGTTATGCAGAGAGAGT GGAGATAATT CT CAAACT CT GAG 
AAAAAGT GTT CAT CAAAAAGT TAAAAGGCACCAGT TAT CACT TTT CTACCAT C CTAGT GACT TTGCTTTTT AAAT GAA 
TGGAC^C^ATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACT GT GCATTGAGTT GGTTC CT G CT C C CCCAAACCAT GTTAAACGT G GCTAACAGT GTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CTGAAATATTAAATAGCT GT ACAGAAGC AGGT T T T ATT TAT CAT GTTAT CTTAT TAAAAGAAAAAG CCCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLSAALLAAELKSKSCSETO 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWG1WNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGIAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSN IMRGCLANQGDLDFEW3SINFI DAMLMVAERLEGP FNIE SVMDP I DVKI SDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLT VFC I LFLVMQREWR 

Important features : 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins . 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 2S 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
C T GAG T CAT C C C CAGGGAT CAGGAGC CT C C AG CAGG GAAC CT T C CAT TAT AT T C T T CAAG C AAC T 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAAT TCAT GACAG TGTC TGGGCT GCCAAAGAAGCAGT GCCCC TGT GAT CAT T 
TCAAGGGCAAT GTGAAGAAAACAAGACAC CAAAGGCACCACAGAAAGCCAAACAAGCAT T CCAGA 
H GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
'Hi AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
SI TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
1 s 1 TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
IJl GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGT GTCAT T TAACCT TAAATGCAAT CAGGAAAGTAGCAAACAGAAGT CAATAAATAT T TT T 
AAATGTCAAAAAAAAAAAAAAAAAA 

ill 

; :: 

iy 
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FIGURE 26 



MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 



H N-myristoylation sites. 

amino acids 27-33, 46-52 

Q 

m 
ui 
m 



Q 

m 
o 
la 
o 
III 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGA.CATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
T GGCCGT GCTGAC TCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGC TCT T CAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
|4 GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
13 CTCTGTGCAGAGCCCAAGGAAGGTCCTG.CAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
rJ TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 

m 

GAGC C CAC C T GGAAACACAT T GGGGAT G G C T GC TGC C T C AC CAGAGAGAC C T GGAAGGAT C T T GA 
GAACGCCCAGTTCTCCGT^AATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GC AGAAT GAGAGAAGACAT T CATGTACCACCTACTAGT CC CT CT CTCCCCAACC T CTGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGAC TCAAAGAGGCAGAGGT T T TGT T CT CAAATATT T T TT AATAAATAGACGAA 
ACCACG 



- 



til 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHl^LLGCWQPLCKSYFPYimVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQI^GSMDVWCTLVLCSVQSPRKVLQETORVLRPGGVLFFWEHYAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 

*** Signal peptide: 

P| 

« 4 amino acids 1-23 

in 

|l| Leucine zipper pattern. 

Ui amino acids 10-32 

N-myristoylation sites. 

fi amino acids 64-70, 78-84, 80-86, 91-97, 201-207 

m 

■7. ' S 

CJ 

w 

flj 
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FIGURE 2Q 



m 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGG7VACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGT TGT T TATGAT TCT TAT AC TAAT T TATACAAAGATATTAAGGCCC T G T TCAT TAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGT TAAAAGC TGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 



MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 



amino acids 3-18 



M Growth factor and cytokines receptors family: 

ill 

Q 

m 
cii 
vi 
* 

* 

o 
w 

13 
W 

c:i 
in 
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FIGURE 31 

GT T T GAATT CC TT CAACTAT ACCCACAGT CCAAAAGCAGACT CAC T GTGT CCCAGGCTACCAGT T 
CCTC CAAGCAAGTCAT T T CCC T TAT T TAACCGATG TGT CCCT CAAACACC T GAGTGCTAC T CCC T 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGGCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT TCCGAGAATCAT T GT CAT GTACATGCAAAACGCACTGAAAGAACAGCAGCATGG T GCAT TGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATGCATATACTACAAC TGCTAT TAATGGGACAGATT TCTGTACATCAGCAAAAGATGC 
ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAAT GATGGAT CGTCAGAAAAGC CC TACT T TAT GGAT CAAGAAT T TC TGAGT T T CGTA 
AAAAGGAGCAAC AAAT TAAACAATGCAAGGGCACAGCAGGACAAGCACTCAT TAAGGAAT GAGGA 
GGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCATTTAGGTATCTGTACCTGGAAAACATT 
T C C T T C T AAG AG C CAT T T AC AGAAT AG AAG AT GAG AC C AC T AG AG AAAAG T TAG T G AAT T T T T T T 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSLAMMFTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAI VS TG I TAVLLVLI FVLRKRI KLTVELFQ I TNKAI S SAP FLLFQPLWT FA 
ILIFFWLWAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALPCEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWC FTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLS VFETVLDALFLC FA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features : 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTT ATGA GGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GAT GTCAAAGCCGGAAAGAT CATCGAT CC TGAGT T CAT T GTGAAATGTCCAGCAGGAT GCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGT TACAAAGGGAGT TATT CCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TAT CGT C T TAGAAAGTAAACCCAAAAAGGGTGTAAC CTACCCAT CAGC T C T TACATACTCAT CAT 
s : CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
O CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
O GCCACAGGAGCCAGGAGAT GGATCT C TGGT CCACT GCCACC TACACAAGCAGCCAAAACAGGCCC 

ff| AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
7 1 GGATGT CAGCC T GGGACT T GT T CCAAAAGAAGAAT TGAGCACACAGT CT T T GGAGCCAGTAT CCC 

ft TGGGAGAT CCAAACT GCAAAAT TGAC TTGTCGTTTT T AATTGATGGGAGCACCAGCAT TGGCAAA 

Ui CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
i[| CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
\f ACACGAATTCTCGAGATCT GAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACTTTCTAAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
!L GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

0 GAC T T GCGAGAGAGT CAGGAAT CAACAT TT TC T TCATCACCAT TGAAGGT GCTGC TGAAAAT GAG 
|fl AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
PI GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 

1 „ | ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
Q AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
f|| TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGT^A 
GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCAT TGCCACT CACCCCGCCAGAGACCACT CCT TC T T TGT GGACGA 
GTTTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCAC 
AGCC T CGGAAC TGAA T T CAGAGCAGGCAGAGCACCAGCAAGTGC TGCT TTAC TAAC TGACG T GT T 
GGAC C AC C C C AC CGC T T AAT G GG GCAC GCACGG T GCAT CAAGT C T T G G GC AG GGCAT GGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGAT CACAAACGTATAGAAT GAGCCAAAAGGC TACAT CAT GT T GAGGGTGC TGGAGAT T T TACAT 
T T T GACAAT T G T TT T CAAAATAAAT G T T C GGAAT AC AG T G C AG C C C T T AC GACAGGC T T AC G TAG 
AGCT T T T GT GAGAT T T TT AAGT TGT TAT T T CT GAT T T GAACTC TGTAACCC T CAGCAAG TT T CAT 
T T TT GTCAT GACAATGTAGGAAT TGC T GAAT TAAAT G T T TAGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAA?^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVS LGLVP KEELS TQSLEPVS LGDPNCKI DLS FL I DGST S IGKRRFRI QKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVVVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENEKQYVVEPNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMIL I TDGRS YDDVRI PAMAAHLKGVI TYAIGVAWAAQEELEVIATHPARDHS FF 
VDE FDNLHQYVPRI I QNI CTE FNSQPRN 

Important features : 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 

CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACC AT GTCT TT T TTT CT GT TT TC AGAGTAGT TC AC AACAGAT CTGAGT GTT TT AATTAAGC ATGGAAT 
ACAGAAAAC AAC AAAAAACT T AAGCT T T AATT TC AT CT GG AATT CC AC AGTT TTCT T AGCTCCCT GGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGG ATG GCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CT C AGC CT T CCC CACT AC AAT GTGAT AGAACG CGT G AACT GGAT GT ACTT CT AT GAGT AT GAGCCGAT TT A 
CAGACAAGACTT TC ACTT CACACT TCGAGAGC ATT CAAACTGCT CT C ATCAAAATCC ATT TCTG GTC ATT C 
pj TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
Q TGGTGGGG AT AT GAGGTT CT T ACATT TT TCTT ATT AGGCC AAGAGGCT GAAAAGGAAGACAAAATGTT GGC 

Cf! ATTGTCCTTAGAGGATGAACACCTTCTTTATGGTGACATAATCCGACAAGATTTTTTAGACACATATAATA 
i * J ACCTGACC TTGAAAACCATT AT GGC AT TCAGGTGGGTAACT GAGTT TT GCCCCAAT GC C AAGT ACGTAAT G 

- * AAGACAGACACT GATGT T TT CATC AAT ACT GGC AAT TT AGT G AAGT AT CT T T T AAACCT AAACC ACTC AGA 

GAAGTT TTT C ACAGGT TATC CT CT AATTGATAATT ATT CCT AT AGAGGAT TT T ACC AAAAAACCCAT ATT T 
CTT ACC AGGAGT AT CCTT TC AAGGTGTT CC CT CCAT ACTGCAGT GGGTTGGGT T AT AT AAT G TCCAGAGAT 
TT GGTG CC AAGGAT CT AT GAAATG AT GGGT CACGT AAAACCC AT CAAGTT T GAAGATGT T T AT GT CGGG AT 

W 

1 1| CT GT T T GAAT T TAT T AAAAGT G AACATTC ATATTCC AGAAGACAC AAATCTT TT CT TT CT AT AT AGAAT C C 

f"1 AT TT GGAT GT CT GT CAACTGAGAC GT GT GATT GCAGCCCATGGCT TTT CT TCCAAGGAGATC AT CACT TT T 

III TGGC AGGT CATGCT AAGGAACACC AC ATGCCAT T AT TAACT T C ACAT T CT AC AAAAAGCCT AGAAGGACAG 

Q GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAZ^AATTCATGGGGAGGTCAGTGTGCTGGCTT 
I'll ACAC TG AACT GAAACTCATGAAAAAC C CAGACT GGAGACTGG AGGGT T AC ACT T GT GAT T TATT AGT CAGG 

CCCT TC AAAG AT GAT ATGTGGAGG AAT TAAAT AT AAAGGAAT TGGAGGTT TTT GCT AAAG AAATT AAT AGG 
ACC AAAC AAT TT GGAC AT GT CATT CT GTAGACT AGAAT TTCT TAAAAGGGT GTT ACT GAGTT AT AAGCT CA 
CTAGGC TGTAAAAAC AAAACAATGTAGAGT TT TATT TATT GAACAATGTAGTCACT T G AAGGT TTT GT GT A 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
ATT ATT T AAAAT TACT TC AAC T TT GT GT T T TT AAAT GT TTT G ACGAT T TC AAT ACAAG AT AAAAAGGAT AG 
TGAATCAT TCT TTACATGCAAACATT TT CC AGTT AC TT AACT GATCAGTT TAT TAT TG AT ACAT CACT CCA 
TT AATG TAAAGT CAT AGGTC AT TATT GC AT AT CAGT AAT CTCTT GGACTT TGT TAAAT ATTT T ACT GT GGT 
AAT AT AGAGAAGAATTAAAGCAAG AAAAT C TG AAAA 
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FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLLLSL^^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVWGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 



Important features: 

Type II transmembrane domain: 

amino acids 20-39 



Ifl N-glycosylation sites. 



amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 



||| L y~ 6 / u-PAR domain proteins. 

amino acids 23-37 



N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGC T CGGGC ACCAGCCGCGGCAAGGATGGAGC TGGGT TGC TGGAC GC AGTTGGGGCTCAC T TT TC TTC AGCT C CT TC TC AT C 
TCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGCCCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAA 
TGAGTGT GACT C CTGCC TGAT CCACCC AGGT T GTACC ATC TT TGAAAAC TGCAAGAGC TGC CGAAATGGCTC ATGGGGGGGT 
ACC T TGGATGAC TTC TATGTGAAGGGGT TCTAC TG TGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGAC TGCATGCGATGT G 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGC 
TAAACCTGGGT T TGTCATCCAAC T AAGAT TT GT CATGT TGAGTC TGGAGTT TGAC TACATGTGCCAGTATGACTATGTTGAG 
GTTCGTGATGGAGACAACCGCGATGGCC AGAT C ATCAAGC GTGTC TGTGGCAACGAGCGGC CAGC TC C TATC C AGAGCATAG 
GATCC TCAC TC CACGTCCTCT TCCACTCCGAT GGC T C CAAGAAT T TTGACGGT TTC CATGCCAT T TAT GAGGAGATC ACAGC 
H ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
Q TAT AC TGGGCAGCGC T GTGAAAATC TCC T TGAAGAAAGAAACTGC TCAGAC C C TGGGGGCCCAGTCAATGGGTACCAGAAAA 

flf TAACAGGGGGC CCTGGGCT TATCAACGGACGC CATGC TAAAAT TGGCACCGT GGTGTC T TTCT T T TGTAACAACTCC TATGT 

|f| TC T TAGTGGCAATGAGAAAAGAAC TTGCC AGC AGAAT GGAGAG TGGTCAGGGAAAC AGCCCATC TGC ATAAAAGCCTGCCGA 

||| GAACCAAAGAT T TCAGACC TGGTGAGAAGGAGAGT T C T TCCGATGCAGGTT CAGTC AAGGGAGACACCATTACACC AGC TAT 

HI AC TCAGCGGCC T TCAGCAAGC AGAAAC TGCAGAGT GC CCC TACC AAGAAGC CAGC C C TTCCCT T TGGAGAT C T GC C CATGGG 

%f% ATACCAACATC T GCATACCCAGC TCCAGTATGAGT GCATC TCACCCT TC TAC C GC CGCC TGGGC AGCAGCAGGAGGACATGT 

\J C TGAGGAC TGGGAAGT GGAGTGGGC GGGC ACCATC C T GCATCCC TATCTGC GGGAAAAT TGAGAACATCACT GCTCC AAAGA 

^ CCC AAGGGT TGC GCT GGCCGTGGC AGGC AGCCATC TACAGGAGGACCAGCGGGGTGCATGACGGC AGCCTAC ACAAGGGAGC 

G TGGT TCC TAGT CTGCAGCGGTGCCCTGGTGAATGAGCGCAC TGTGGTGGT GGCTGCCC AC TGTGTTAC TGAC C TGGGGAAG 
GTCACCATGAT C AAGACAGCAGACC TGAAAGT TGT T T T GGGGAAATTCTAC C GGGATGATGAC CGGGATGAGAAGACCATC C 
AGAGCCTACAGAT TT C T GC TATC ATTC TGCAT CCCAAC TATGACCCC ATCC T GCT TGATGC TGACATCGCCATCCTGAAGC T 
CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
TCCCACATC ACT GTGGC TGGC TGGAATGTCC TGGCAGAC GTGAGGAGCCCT GGCT TCAAGAACGACAC ACTGC GC TC TGGGG 
TGGTCAGTGTGGTGGAC T CGC TGC TGTGTGAGGAGCAGCAT GAGGACC ATGGCAT C CCAGT GAGTGTC ACTGATAACATGT T 
CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
C GAGCATC TCC TGAGC C ACGC TGGCATC TGAT GGGAC TGGTCAGC TGGAGC TATGATAAAACATGCAGCCACAGGCT C TCC A 
C TGCC TTC ACC AAGGT GC TGC C T T T TAAAGAC TGGAT TGAAAGAAAT ATGAA AT GAA CC ATGC TC ATGCAC TCCT TGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
G TGCC AGGGCT TC TGAC T TCAGGGACAAAAC T C AGT GAAGGGTGAGT AGAC C T CC AT TGC TGGTAGGC TGATGCCGCGTCCA 
C TACTAGGACAGCCAAT TGGAAGATGCC AGGGC TT GCAAGAAG TAAGTT TC T T CAAAGAAGACCATATACAAAACCTCTCCA 
C TCCACTGACC TGGT GGTC TTCCCCAAC T TTCAGT TATACGAATGCCATCAGC TTGACCAGGGAAGATC TGGGC T TC ATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
G AAC T AC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ I LLE S YPLNAHCEWT IHAKPG FVI QLRFVMLS LE FDYMCQYDYVEVRDGDNRDGQ I 1 
KRVCGNERPAPIQS I GSSLHVLFHSDGSKNFDGFHAI YEE I TACSS S PCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKI TGGPGL INGRHAKI GTWS FFCNKS YVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAI YRRT S GVHDGSLHKGAW FLVC S GALVNERT VWAAHCVT DLGKVTM I KTADL 
KWLGKFYRDDDRDEKT I QSLQ I SAI I LHPN YDP I LLDAD I AI LKLLDKAR I S TRVQP I CLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site, 
amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
T GAAGGTT CT CAGTCAAAT T CTTT GT GAT CTACT GATT GT GGGGGCAT GGCAAGGT TT G CTTAAAGGAGCTTG GCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
T GGT CTTGCCTT GGCTCAGT CCT GCTAACTACATT GACAAT GT GGGCAAC CTGCACTT CCT GTATT CAGAACTCT GTA 
AAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
T GACAGCCACGGCTC CCT C C CCAGAG GTTTCTGCAGCT GCCACCAT CT C CTTAATGACAGACGAGCCT GG CCTAGACA 
ACC CT GCCTACGT GT CCTC GGCAGAG GACGGGCAG CCAGCAAT CAGCC CAGTGGACTCT GGCCGGAGCAACCGAACTA 
GGGCACGGCC CTT T GAGAGAT CCACTAT TAGAAGCAGAT CATTTAAAAAAATAAAT CGAGC TTT GAGT GTT CTT C GAA 
GGACAAAGAGCGGGAGTGCAGTT GCCAAC CATGC CGACCAGGGCAG GGAAAATT CT GAAAACACCACT GC C CCT GAAG 
TCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATC^AGATCAATCGAGTAGATCCCAGTGAAA 
GCCTCT CTAT TAGGCT GGTGGGAGGT AGCGAAACCC CACT GGT CCATAT CATTAT CCAACACATTTAT CGT GAT GGGG 
TGAT CGCCAGAGACGG CCGGCTAC T GCCAGGAGACATCATT CTAAAGGT CAACGGGAT GGACATC AGCAAT GT CCCTC 
ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
GCAGGAACAATGGACAGGCC CCGGAT GC C TACAGAC CCC GAGAT GACAGCTTT CAT GT GATTCT CAACAAAAGTAGCC 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCT^ATGTGCTGGATGGCG 
GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCGATATGGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGC CCT GACAT CTTTCAGGAAGCCGGCT GGAACAGCAAT GGCAGCT GGT CC CCAGGGCCAGGGGAGAGGAG C AACA 
CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 
GCATGACCGT C GCAGGGGGAG CAT CACATAGAGAAT GG GATTT GCCTAT CTAT GTCAT CAGT GTT GAG CCCGGAGGAG 
T CATAAGCAGAGATGGAAGAATAAAAACAG GTGACATT TTGTT GAAT GT GGAT GGGGTCGAACTGACAGAGGT CAGCC 
GGAGT GAGGCAGT GGCATTATT GAAAAGAACAT CATCCTCGATAGTACT CAAAGCTTT GGAAGT CAAAGAGTAT GAGC 
C CCAGGAAGACT GCAGCAGC CCAGCAGCCCT GGACT CCAACCACAACATGGCC CCACC CAGT GACT GGT C CCCAT CCT 
GGGTCAT GTGGC TGGAAT TACCAC GGT GCTT GTATAACTGTAAAGATAT T GTAT TACGAAGAAACACAGCT GGAAGTC 
T GGGC T T CTGCAT TGTAGGAGGTTAT GAAGAATACAAT GGAAACAAACCTT TTT T CAT CAAAT CCATT GTT GAAGGAA 
CAC CAGCATACAAT GAT GGAAGAATTAGAT GT GGT GATATT CTTCTTGCT GTCAAT GGTAGAAGTACATCAGGAAT GA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TTT TATAGAAT CAAT GAT GGGTCAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACT ATAT TTAT C 
TTGTCAGTTTT TATATTTAAAGAAAGAATACATTGTAAAAAT GT CAGGAAAAGTAT GAT CAT CTAATGAAAGCCAGT T 
ACACCT CAGAAAATAT GATT CCAAAAAAAT TAAAACTACT AGTTTTT TT T CAGTGT GGAGGATTT CT CATTACT CTAC 
AACATT GT TTATATTTTT TCTATTCAATAAAAAGCCCTAAAACAACTAAAAT GATT GATTT GTATACC CCACT GAATT 
CAAGCT GATTTAAAT T TAAAAT TTG GTATAT GCTGAAGTCT GCCAAGGGTACATTATGGC CAT T TT TAATT TACAGCT 
AAAAT ATTTTTTAAAAT GCATT GCT GAGAAACGTT GCTTT CAT CAAACAAGAATAAATATT TTTCAGAAGTTAAA 
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FIGURE AO 

MKALLLLVLPWLSPANYIDNVGNLHFLYSSLCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I SLMTDEPGLDNPAYVS SAE DGQPAI S PVDSGRSNRTRARP FERS T IRSRS FKKI NR 
ALS VLRRTKSGSAVANHADQGRENS ENT TAPE VFPRL YHL I PDGE ITS I KINRVDPSE SLS IRL V 
GGSETPLWIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEWEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
R IRCGDI LLAVNGRS T SGMI HACLARLLKELKGRI TLT I VS WPGT FL 

Important features : 
Signal peptide: 

amino acids 1-15 

N -glycosylate on sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myr i s toyl a ti on s i tes . 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGT ATCTTCAGTTGTCATCAAGT T CGCAAT CAGATTGGAAAAGCTCAACTT GAAGCTT T 
CTTGCCT GCAGT GAAGCAGAGAGATAGATATT ATTCACGTAATAAAAAAC ATG GGCT TCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCT CATTT TGGGGAAGGGAAAAACTCTGACTAATGAAGCATC CACGAAGAAGGT AGAACTT 
GACAACTGTCCTT CT GTGT CT CCTTACCT CAGAGGCCAGAGCAAGCTCATT TT CAAACCAGATCT CAC 
TTT GGAAGAGGTACAGGCAGAAAAT CCCAAAGT GT C CAGAGGCCGGTAT CGCCCT CAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CAT C T GC AT CC C T TC C T GC AGAGGCAGCAGC T G GAT T AT GGCAT C TAC GTCAT C CAC CAG GCT G AAG G 
TAAAAAGTTTAAT CGAGCCAAACTCTTGAAT GTGGGCTAT C TAGAAGCCCT CAAGGAAGAAAAT TGGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CAT C CCAAGCAT CTGGTGGTT GG CAG GAACAGCAC T GGGT ACAGGT T AC GT TAC AGT GGAT AT T T TGG 
GGGT GTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAAT GGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
C CT GAAGT GG G T AAAT AT AC AAT GG T C T T C CAC AC T AGAG ACAAAGG C AAT G AGG T GAAC G CAGAAC G 
GAT GAAGCTCTTACACCAAGTGT CACGAGTCT GGAGAACAGAT GGGT TGAGTAGTT GTT CTTAT AAAT 
T AGT AT C T GT GGAAC AC AAT C CT T TAT AT AT C AACAT CAC AGT GGAT TTCTGGTTTGGTG CATGACC C 
T GGATCT TTTGGTGATGTT TGGAAGAACTGAT TCTT TGTT TGCAATAATTT T GGCCTAGAGACT TCAA 
ATAGTAGCACACATTAAGAACCT GT TACAGCT CAT T GTTGAGCTGAATT TT T CCTT TTTGTATTTTCT 
TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGT TGTAACAAGACAGCT TTCTTAGTCATTTTGAT 
CAT GAGG GT TAAATAT T GTAATAT GGATACTTGAAGGACT TTAT ATAAAAG GATGAC TCAAAGGAT AA 
AATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATTTATTTAAATTTGAAGTAATATATTATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TTTAAAGAGTT T TTGTAAAATGATTT TGTACAAGT AGGATAT GAAT TAGCAGTTTACAAGTTTACAT 
ATTAACTAATAATAAATAT GTCTAT CAAATACCTCT GTAGT AAAATGT GAAAAAGCAAAA 
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FIGURE 42 

MG FNLT FHLS YKFRLLLLLT LCLT WGWAT SNYFVGAI QE I PKAKE FMANFHKTL I LGKGKTLTN 
EAS TKKVELDNCPSVSPYLRGQSKL I FKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAI LVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEE3STWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE Aft 



GCTCAAGACCCAGCAGT GGGACAGCCAGACAGACGGCACGATGGCACT GAGCTCCCAGAT CT GGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCT GCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGC TGGAT GCCCAT 

GTTCCAGAGGCGAAGGAGGGGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAPAAAAAAAAA 
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FIGURE AX 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CI FCCGCCHRSKCGMCCKT 



Important features: 
Signal peptide: 

amino acids 1-24 



f*\ cAMP- and cGMP-dependent protein kinase phosphorylation site. 

fee* 

O amino acids 58-59 



N-myristoylation site. 

amino acids 44-50 

a Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 

Iff 

1=1 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GAC C TT CAACAC AACC C C T C T T GT CAC CAT AC AGC CAGAAG GGGGC AC TAT C ATAGT GAC CCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTTTCTGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGG 
AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEE DVI YTWKALGQAANESHNGS I LP I S WRWGES DMT FI C 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLF^GLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 

PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGC6TTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGT C T GATGGCCAT T C CAGCAACAACAAT GT CC T T GACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGACATTCATCCAGAATCCTTCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
AC AC C AT GG C GAG T GG C T GGAGAG CAT C TAG T T T C C AC T T C GAT T C T GAAGAAAACAAAC AT AGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T GTAGT T T AATGGGAATAAAAT GTAAGTAT CAGTAGT T T GAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGVVLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGT GAGGAACCT GCAGAGT CACACAGT TGCT GACCAATTGAGC T GT GAGC C TGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
C AGC CAGAAGC AGCAC AG T C C T G TAC C T GAGAAG GC C AT C CC AC TC AT CAC T C C AGG C T C T GC CA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTIALLLLAGLTALEANDPFAN^^ 
SQKQHSPVPEKAIPLITPGSATTC 



Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

z""s 

::f amino acids 36-59 



N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE 51 



GT GGACT CT GAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGC TGCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCC T GGT GGGAGGAAGACACT CT GGAGAGAGAGGGGGCT GGGCAGAGATG?\AGT TCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
l, k GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
|!| AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
O TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
f 1 GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
|p CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
*|| GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
%!■ GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
* AGCCAGCAACGAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
^ ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
%l AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
y TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
C| CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
W CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

CT TCAGAGGACAGGGAGTT TCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT TTTAAATCCAAGCTGGGT TTCATCAACTGGGAT GCCATAAACAAGGACCAGAGAAGCTCTC 
GCATCCCGTGACC TCCAGACAAGGAGCCAC.CAGATTGGAT GGGAGCCCCCACACTCCCTCCT TAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 



o 

in 



D 



MKFQGPIACLLIJ^CLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVIRHGADAVRGSWQGVT? 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSM4REISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGWTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRS SRI P 

Signal peptide: 

amino acids 1-21 



/i% N-glycosylation site. 

\f amino acids 2 65-2 69 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



rp-x Casein kinase II phosphorylation site. 

fi'l amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTAGGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
!^ TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
Q AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
O GAAGTCCTATATAACGATCTTCAACAAGAGT GCAAACATCATGCT TGACAAGTGGCAGCACCT GG 

f I CC T CAGAGGGCAGCAGT CGT C TGGACAT GT TTGAGCACAT CAGCC T CAT GACCT TGGACAGTC TA 

jj g s 

?P CAGAAAT GCATC T T CAGCT T T GACAGCCATTGTCAGGAGAGGCCCAGT GAAT ATAT TGCCACCAT 

■}i CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 

%J ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 

5 GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGC CAAGTCCAAGAC T T T GGAT T TCAT T GAT GTGC T T C T GCTGAGCAAGGATGAAGATGGGAAGG 

|?| CAT TGT CAGAT GAGGAT AT AAGAGCAGAGGCT GACACCT T CATGTT T GGAGGCCAT GACACCACG 

fsj GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 

O ACAGGAGGTGCAAGAGCT T CT GAAGGACCGCGAT CC TAAAGAGATT GAAT GGGAC GACCT GGCCC 

m 

i& AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGG.TGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 

MS LLSLPWLGLRPVAMS PWLLLLLWGS WLLAR I LAWT YAFYNNCRRLQC FPQPPKRNWFWGHLG 

LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 

WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 

SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 

RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 

MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 

LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 

LAFIPFSAGPRNCIGQAFAMAEMKWIJU^LL^ 

VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 

AT CGCAT CAAT TGGGAGTACCATCT TCCTCATGGGACCAGTGAAACAGCT GAAGCGAATGT T T GA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATTCATGGCCAGT T T TATGAAGC TT T GGAAGGCACT AT GGACAGAAGC TGGTGGACAGT T T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
H GT GATT C GAACAT T T GAGGGT TAC T T TT GGAAGCAACAATACAT T C TCGAACCTGAATGT CAG TA 

D GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCT GGAT GT TG TC CCACTGAAT TCCCAT GAATACAAAC CTAT TCAGCAACAGCAAAAAAAAAAAA 

m 

AAAAAAAAAAAAAAMAAAAAAAAAAAAA?U\AAAAAAAAAAAAAA 

%M 
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FIGURE 56 

MG P VKQLKRM FE P T RL I AT IMVL LC FAL T LC S AFWWHNKGLAL I FC I L Q S LAL T W Y S L S F I P FAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

amino acids 30-72 
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FIGURE 52 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAAC ATG CTCTGTCTGTGCCTG 
TACGT GCCGGTCAT C GGGGAAGCC CAGACC GAGTT CCAGTACTT T GAGT CGAAGGGGCT C CCT G C CGAGCTGAAGTC C 
AT TTT CAAG CTCAGT GT CTT CAT C CCCT C CCAGGAATTCT CCAC CTAC CGCCAGT GGAAGCAGAAAATTGTACAAGCT 
GGAGAT AAGGAC CT T GAT GG GCAGCTAGACTT T GAAGAAT TT GT CCATTATCTCCAAGAT CAT GAGAAGAAG CTGAGG 
CT GGT GTTTAAGAT TTT GGACAAAAAGAAT GAT G GACGCATT GACGCGCAG GAGATCAT GCAGT C CCT GCGG GACTT G 
GGAGT CAAGATAT CT GAACAGCAGGCAGAAAAAAT TCT CAAGAGCAT GGATAAAAACG GCAC GAT GACCATC GACTGG 
AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 
ATCTTTGATGTGGGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 
CACCT GGTGGCAGGAGGT GGGGCAGGGGCCGTATCCAGAACCT GCACGGCCCCCCTGGACAGGCT CAAGGTGCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 
T CACT CT GGC GGGGCAAT G GCAT CAACGT CCTCAAAAT T GCCC CCGAAT CAGC CAT CAAATT CAT GGCCTAT GAG CAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
AT CGCCCAGAGCAG CAT CTACC CAATGGAGGT CCT GAAGACCC GGAT GGCGCT GC G GAAGACAGGCCAGTACT CAGGA 
AT GCT GGACT GC GC CAG GAGGAT CCT GGCCAGAGAGGGGGTGGCC GCCTT CT ACAAAGGCTAT GT CCCCAACAT GCT G 
GGCAT CAT CC CCTAT GC CGGCAT CGAC CT T GCAGT C TACGAGAC GCT CAAGAAT GC CT GGCT GCAGCACTAT GCAGT G 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCC CT GGCC CTAGT CAGGACCCGGATG CAGGCGCAAGCCT CT AT T GAGGGCGCT C CGGAGGT GAC CAT GAGCAGC CT C 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCT GT GAGCAT CAGCTACGT GGT CTAC GAGAACCT GAAGAT CACCCT GGGC GT GCAGT C GCG G TGA C GGGGG GAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAG CT GT CT C GAGCCAAGCT GT GAAAACCC TAGACGCACCCG CAGG GAGG GT GGGGAGAGCT GGCAGGCCCAGGGCTT 
GT C CT GCT GACC CCAGCAGACCCTCCT GTT GGTTCCAGCGAAGACCACAGGCATT CCTTAGGGTC CAGGGT CAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGC CT GGACC CT GT CAGGAT GGGCCCCACCT CAGAACCAAACT CACT GTC C CCACTGT GGCAT GAGG GCAGT GGAGCA 
CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
ACT GTT GGGAAAAGGGTT T T GT CCAGAAGGACAAG CCGGACAAAT GAGCGACT TCT GT GCT T CCAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGAC CAGCC CCACATT CCACTT GTGTCACT GCTT GGAACCTATT TATTTT GTATT TATT TGAACAGAGTTAT GT CCT 
AACTATTTTTATAGAT TT GTT TAATTAATAGCT T GT CATT TT CAAGTT CATT TTTT ATTCATATTTAT GTT CATGGTT 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAG GAAAT CTT C AAT AGG AT G CAAAGAT CAAT G CAAAAATT GT TAT AT AT GAAC ATAT AAC T G GAGT C GT C AAAAAG 
CAAATTAAGAAAGAATTGGACGTTAGAAGTTGTCATTTAAAGCAGCCTTCTAATAAAGTTGTTTCAAAGCTGAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKS I FKLSVFI PSQEFST YRQWKQKI VQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE 1 1 LYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
ILAREGVAAFYKGYVP3SIMLGI I PYAGIDIAWETLKNAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLASYPLALVRTRMQAQAS IEGAPEVTMSS L FKHI LRTEGAFGL YRGLAPNFMKVI PAVS I S YV 
VYENLKI TLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE gQ 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT T GGGGAGGAT GGAATCCT GAGC T GCACT T T T GAACCT GACAT CAAACT T TCTGATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
%~k GGCAAT GCCT C TT T GCGGC T GAAAAACGT GCAACTCACAGAT GCTGGCACC TACAAAT GT TATAT 

13 CAT CACT T CTAAAGGCAAGGGGAAT GCTAACCT TGAG TATAAAA.C TGGAGC CT TCAGCAT GCCGG 

Jj{ AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
ZI CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
Ifl CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
Ul TCAACAACACATAC T CC T GT ATGAT T GAAAAT GACAT T GCC AAAGCAACAGGGGATAT CAAAGTG 

^ ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
* CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
In GTGCC TT GGCCACAAAAAAGCAT GCAAAGTCAT TGT T ACAACAGGGAT CT AC AGAACTAT T T CAC 

CAC CAGATAT GACC TAGT T T TATAT T TCT GGGAGGAAAT GAAT T CATATCTAGAAGTC T GGAGT G 
111 AGCAZ\ACAAGAGCAAGAAA.CAA?UyiGAAGCCAAAAGCAGAAGGC TCCAATAT GAACAAGATAAAT 

W C TATCT T CAAAGACATAT TAGAAGT T GGGAAAATAAT TCATGT GAAC TAGACAAGTGT G T TAAGA 

HI 

GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAAT GTTGC TCT GAGGAAGC C CC T GGAAAG T C TAT C C C AAC AT AT C CAC AT C T TATAT T CCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T CT TCCCAACT GACAAAT GCCAAAGT T GAGAAAAATGAT CATAAT T T TAGCATAAACAGAGCAGT 
CGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAAAAAAAAAT^AA^^AA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 

MAS LGQILFWSIISIIII LAGAI AL I I G FG I S GRH S I T VT T VAS AGN IGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI SWALLPLS PYLMLK 



Important features : 
Signal peptide: 

amino acids 1-28 



JZ Transmembrane domain: 

m 

: .n amino acids 258-281 



N-glycosylation sites . 



amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 



I'll amino acids 52-58, 126-132, 188-194 



APP ID=10063597 



Page 210 of 318 



FIGURE 61 



'i 



TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCGCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGT GGACTCAGAT CACAGTGGC TAT AT CT CCATGAAGGAGCTAAAGCAGGCCC TGGTCAAC TGCA 
ATT GGTCT T CAT TCAAT GAT GAGACC T GCCT CATGAT GATAAACATGTT TGACAAGACCAAGT CA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
\| GGCC TT CCGGGAGAAGGACACAGC TGTACAAGGCAACATCCGGC TCAGC T T CGAGGAC TT CGT CA 

3 CCATGACAGCT TCT CGGATGC TATGACCCAACCATC TGT GGAGAGTGGAGT GCACCAGGGACCT T 

^ TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 

III 

AT TCTCCCTTGCT TGAT GC AACAC TGT T CCAAAAGAGGG T GGAGAGT CCT GCATCATAGC CACCA 

%J 

l , 1 AATAGTGAGGACCGGGGC T GAGGCCACACAGATAGGGGC CTGAT GGAGGAGAGGATAGAAGT T GA 

Q ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
III AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 

GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCAT GAGAGAGAT C TGT C TAT GGGACCAGTGGCT TGGAT T CT GCCACACCCAT AAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T T GGGGCCAAAAGT C CAGTGAAAT TG TAAGC T TCAAT AAAAGGATGAAACT C TGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLMMINMFDKTKSGRIDVYGFSALWKFIQ 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNI RLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAA.CCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGCTGGGGAAATATT7VATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGC TAGA GCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGC TCAT C T CCAATAAAAT AAAAGCAC T TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAA?yy\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 6d 

MQGRVAGSCAPLGLLLVCLHLPGLFARS I GWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6s 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
J=f GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
£|| GGCAGT GCCCTCTC TGGGCAGTCAGAT C CACCCAGTGCT TAATAGCAGGGAAGAAGGTAC T TCAA 

W AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTATTCACTTTTATATATTTATATAAAATTAG 
TAGT GAGATG TAAAAAAAAAAAAAAAAAA 

b 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQI QTTSANENSTVLPSS TS SS SDGNLRPEAI TAI I WFS 
LIJ^UjLLAVGIx^LVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

! amino acids 56-80 



N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 



N-myristoylation sites. 

amino acids 7-13, 2 6-32 
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FIGURE 6q 

G CCAGGAATAACTAGAGAGGAACAAIGGGGTT AT T CAGAG GT TTT GTTT T CCT CTTAGT T CT GT GCCTGCT GCAC CAG 
T CAAATACT TCCTT CAT T AAGCT GAAT AATAATGGCT TT GAAGATAT TGT CATT GT TATAGATC CTAGT GTGCCAGAA 
GAT GAAAAAATAATT GAACAAATAGAGGAT AT GGT GACTACAGCT T CTACGTACCT GTT T GAAGCCACAGAAAAAAGA 
T T TT TTTT CAAAAAT GTAT CTATATTAATT C C TGAGAATT GGAAGGAAAAT CCT CAGTACAAAAGGC CAAAACAT GAA 
AACCATAAACAT GCT GATGTT ATAGTT GCAC CAC CTACACTCCCAG GTAGAGAT GAACCAT ACACCAAGCAGTT CACA 
GAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCT GACCTTCTACTTGGAAAAAAACAAAAT GAATATGGACCA 
C CAGGCAAACT GTT T GT CCAT GAGT GGGCT CACCT CC GGT GGGGAGT GT TT GAT GAGTACAATGAAGAT CAGCCT TT C 
TACCGT GCTAAGTCAAAAAAAATC GAAGCAACAAGGT GTT C CGCAGGTAT CT CT GGTAGAAATAGAGTTTAT AAGTGT 
CAAG GAGGCAG CT GT CTTAGTAGAGCATGCAGAATT GATT CTACAACAAAACT GTATGGAAAAGAT T GT CAATT CTTT 
C CTGAT AAAGTACAAACAGAAAAAGCATC CATAAT GT TT AT GCAAAGTATTGATT CTGTT GTT GAATT T T GTAACGAA 
AAAACCCAT AAT CAAGAAG CT C CAAGC CTACAAAACATAAAGTGCAATT TTAGAAGTACAT G GGAGGTGATTAGCAAT 
TCTGAGGATTTTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATCAGTCAA 
AGAATTGT GT GCTTAGTT CTTGAT AAGT CT GGAAGCAT GGGG GGT AAGGACCGCCTAAAT CGAAT GAAT CAAGCAGCA 
AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
AATAAGCTAATC CAAATAAAAAGCAGT GAT GAAAGAAACACACT CAT GGC AG GATTACCTACAT ATC CT CTGGGAGGA 
ACTT CCAT CT GCT CT GGAATTAAATAT GC ATTT CAGGT GATTGGAGAGCT ACATT CCCAACT CGATGGATCCGAAGTA 
CT GCT GCT GACT GAT GGGGAGGATAACACT GCAAGTT CTT GTATT GAT GAAGT GAAACAAAGT GGGGCCATT GTT CAT 
TT TATT G CT TTGGGAAGAG CTGCT GAT GAAGCAGTAATAGAGAT GAGCAAGATAACAGGAGGAAGTCATTTTTAT GT T 
TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
T C CCTTCAG CTCGAAAGTAAGGGATTAACACT GAATAGTAATGCCTGGAT GAACGACACT GT CATAATT GAT AGTACA 
GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
ATAATGGAAAATTTCACAGTGGATGCAACTTCCAAAATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACT 
TGGG CATACAAT CT T CAAGC CAAAGCGAACCCAGAAACATTAACTAT TACAGTAACTT CT CGAGCAGCAAATT CTTCT 
GT GC CT CCAATCACAGT GAATGCTAAAAT GAATAAGGAC GTAAACAGTT T C CCCAGCC CAAT GATTGT TT ACG CAGAA 
ATT CTACAAGGAT AT GT ACCT GTT CTT GGAGC CAATGT GACT G CT TT CATTGAAT CACAGAAT GGACATACAGAAGTT 
TT GGAACTT TTGGAT AAT GGT G CAGGC GCT GATTCTTT CAAGAAT GAT GGAGT CTACT CCAG GTATTT T ACAGCATAT 
ACAGAAAATGGCAGATATAGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTCCA 
CT GAATAGAGCCGCGTACAT AC CAGGCT GGGTAGT GAAC GGGGAAAT T GAAGCAAACC CGC CAAGAC CTGAAATT GAT 
GAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGAGGTGCATTTGTGGTATCACAAGTCCCAAGC 
CT T CCCT T GCCT GAC CAAT AC C CAC CAAGT CAAATCACAGACCTT GAT GC CACAGTT CAT GAGGAT AAGAT TATT CTT 
ACATGGACAGCAC CAGGAGATAATTTT GAT GTT GGAAAAGTT CAAC GTTAT AT CATAAGAAT AAGT GCAAGTATT CTT 
GAT CTAAGAGACAGTT TT GAT GATGCT CTT CAAGTAAATACTACT GAT CT GT CAC CAAAGGAG GC CAACT CCAAGGAA 
AGCTTT G CATT TAAACCAGAAAATAT CT CAGAAGAAAAT GCAACCCACATAT TTAT TGCCATTAAAAGTATAGATAAA 
AGCAAT T T GACAT CAAAAGTAT C CAACATT GCACAAGTAACTTTGTTTAT C CCT CAAGCAAAT CCT GATGACATT GAT 
CCTACACCTACT C CTACT CCTACTCCTACT C CT GATAAAAGT CATAAT TCT GGAGTTAATATT TCTACGCT GGTAT T G 
T CTGT GATTGGGT CT GTT GTAAT T GTTAACT TTAT TTT AAGT ACCAC CATT TGAACCTTAAC GAAGAAAAAAATCT T C 
AAGTAGAC CTAGAAGAGAGTTT T AAAAAACAAAACAAT GT AAGT AAAGGAT ATTT CTGAAT CTTAAAATT CAT CCCAT 
GT GTGAT CAT AAACT C^TAAAAATAATTTTAAGATGT CGGAAAAGGATACTTT GATTAAATAAAAAC ACT CAT GGATA 
T GTAAAAACT GTCAAGAT TAAAATTTAATAGTTTCATT TATT T GT TATTTT ATTT GTAAGAAATAGT GAT GAACAAAG 
AT CCT TTTT CAT ACT GAT AC CT GGT T GTATATTAT TT GAT GCAACAGTTTT CT GAAAT GATATTT CAAATT G CAT CAA 
GAAATTAAAAT CATCTAT CT GAGTAGT CAAAATACAAGTAAAGGAGAGCAAAT AAACAACATT TGGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 

MGLFRGFVFLLVLCLLHQSNTSFIKLN1MGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKGNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFWSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNA1MNDTVIIDSTVGKDTFFL 
I TWNS LPPS I S LWDPSGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 
AANSSVPPITWAKMNKDTOSFPSPMIVYAEILQGWPVLGANVTAFIESQNGHTEVLELLDNGA 
GADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVWGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHI FIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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CT CCTTAGGT GGAAACCCT GGGAGTAGAGTACTGACAGCAAAGACC GGGAAAGACCATAC GT CCCCGGG CAGGGGTGA 
C AACAGGTGT CAT CTT TTT GAT CT C GT GT GT GGCT GC CTT CCTATTT CAAGGAAAGACGC CAAGGTAATTT T GACCCA 
GAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGT GACT T GCCT GAGGGT GGACCAGAAGAAAGGAAAGGT CC C CTCTT GCT GT T GGCT GCACAT CAGGAA 
G GCT GT GAT GGGAAT GAAGGT GAAAACT T GGAGATTT CACTT CAGTCATT GCTT CT GCCT GCAAGAT CAT CCTTTAAA 
AGTAGAGAAGCT GC T CT GT GT GGT GGTTAACT CCAAGAGG CAGAACT CGT T CTAGAAGGAAATGGAT GCAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 

l„ L cgcaactacgtgagcagcctgaagcggcagatcgcacagctcaaggaggagctgcaggagaggagtgagcagctcagg 

j)** aatgggcagtaccaagccagcgatgctgctggcctgggtctggacaggagccccccagagaaaacccaggccgacctc 

tj ctggccttcctgcactcgcaggtggacaaggcagaggtgaatgctggcgtcaagctggccacagagtatgcagcagtg 

**l cctttcgatagctttactctacagaaggtgtaccagctggagactggccttacccgccaccccgaggagaagcctgtg 

5* aggaaggacaagcgggatgagttggtggaagccattgaatcagccttggagaccctgaacaatcctgcagagaacagc 

I? t cccaatcaccgtccttacacggcctctgatttcatagaagggatc^^ 

I 1 1 gagctcaccttcaaaggggacc^(^aacacgaattcaaacggctcatcttatttcgaccatt cagccccatcat gaaa 

T£t gtgaaaaatgaaaagctcaacatggccaacacgcttatcaatgttatcgtgcctctagcaaaaagggtggacaagttc 

II 1 . cggcagttcatgcagaatttcagggagatgtgcattgagcaggatgggagagtccatctcactgttgtttactttggg 

k f\ aaagaagaaataaatgaagtcaaaggaatacttgaaaacacttccaaagctgccaacttcaggaa 

; :'! cagctgaatggagaattttctcggggaaagggacttgatgttggagcccgcttctggaagggaagcaacgtccttctc 

' ttttt ctgtgatgtggacatctacttcacatctgaattcctcaatacgtgtaggctgaatacacagccagggaagaag 

i5 gtattttatccagttcttttcagtcagtacaatcctggcataatatacggccaccatgatgcagtccctcccttggaa 
cagcagctggtcataaagaaggaaactggattttggagagactttggatttgggatgacgtgtcagtatcggtcagac 

LI ttcatcaatataggtgggtttgatctggacatcaaaggctggggcggagaggatgtgcacctttatcgcaagtatctc 

HI cacagcaacctcatagtggtacggacgcctgtgcgaggactcttccacctctggcatgagaagcgctgcatggacgag 

i| ctgacccccgagcagtacaagatgtgcatgcagtccaaggccatgaacgaggcatcccacggccagctgggcatgctg 

%J gtgttcaggcacgagatagaggctcaccttcgcmac^gaaacagaagac^ 

f : I GAAGGAT T GT GGGAGACACTTTT T CTTT CCTTT T GCAATTACTGAAAGTGGCT GCAAC AGAGAAAAGACTTC CATAAA 

Jzl GGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAAAGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGA 

tsJ AATCAAAATCTCCGCTTTGCCTGC^W^GTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 

ffl AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AATATTCATGATTTAAGAGCAGTTTTGTAAAAAATTCATTAGCATGAAAGGCAAGCATATTTCTCCTCATATGAATGA 
GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTATTATGATACT 

agtgagtac^ttaagtaaaataaaatggaccagaaaagaaaagaaaccataaatatcgtgtcatattttccccaa 

ttttttcccttgtgagttatagtctgcttatttaattaccactttgcaagccttacaagagagcacaagttggcctac 
attt ttatatttt ttaagaagatactt t gagat gcattat gagaact ttc agtt caaagcat caaatt gatgccatat 
ccaaggacatgccaaatgctgattctgtcaggcactgaatgtcaggcattgagacatagggaaggaatggtttgtact 
aatacagacgtacagatactttctctgaagagtattttcgaagaggagcaactgaacactggaggaaaagaaaatgac 
actttctgctttacagaaaaggaaactcattc^gactggtgatatcgtgatgtacctaaaagtcagaaaccacatttt 
ctcctcagaagtagggaccgctttcttacctgtttaaataaaccaaagtataccgtgtgaaccaaacaatctcttttc 
aaaacagggtgctcctcctggcttctggcttccataagaagaaatggagaaaaatatatatatatatatatatattgt 
gaaagatcaatccatctgccagaatctagtgggatggaagtttttgctacatgttatccaccccaggccaggtggaag 
taact gaattatttt ttaaattaagcagtt CT ACT caatcac caagat gctt ctgaaaatt gcat tttatt accattt 
c^wvctattttttaaaaataaatacagttaa 

atgcatgagctaattatctctttgagtccttgcttctgtttgctcacagtaaactcattgtttaaaagcttcaagaac 
attcaagctgttggtgtgttaaaaaatgcattgtattgatttgtactggtagtttatgaaatttaattaaaacacagg 
ccatgaatggaaggtggtattgcacagctaataaaatatgatttgtggatatgaa 
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FIGURE 72 

MMiyiVRRGLLAWISRVWLL^ 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 

AE VNAGVKLATE YAAVP FDS FTLQKVYQLE TGLTRHPEEKPVRKDKRDE LVEAI E SALE TLNNPA 

ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 

TLINVIVPIAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAANFRNF 

TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 

NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 

KYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQ 

RKQKQKT S S KKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-my ri s toylati on si tes . 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCAT GATACCCT ACT GAACACCGAAT CCCC TGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GT CAACACTGGACGTATGAGGGCCCACATGGTCAGGACCAT T GGCCAGCC TC T TACCCT GAGT GT 
GGAAACAAT GCCCAGT CGCCCATCGAT AT T CAGACAGACAGTGTGACAT T TGACCC TGAT T T GCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
T GAAGC CACAT T TGCAGAGC TCCACAT T GTACAT TATGACTCTGATT CCT AT GACAGCT T GAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGGATCCTAATTGAGGTGGGTGAGACTAAG 
AAT AT AGC T TAT GAACACAT T C T GAG T CAC T T GCAT GAAG T C AG GCAT AAAGAT C AGAAG ACC T C 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATTTCAAT GGAACAGCTGGAAAAGCT TCAGGGGACATTGT TCTCCACAGAAGAGGAGCCCT CTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGT CT TCACC TCAGCACAAGCCACGACT GAGGCATAAAT TCC T TC TCAGAT AC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAAT CGCTG TGT T GT TAAT GCAGAGANCAAACT CTGT TTAGT T GCAGGGGAAGT TT GGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C TCCTTAGGATAAAGAGTT GCTGTTGAAGTTGTATAT TT TTGATCAATATATT TGGAAATTAAAG 
TTTCTGACTTT 



FIGURE 74 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryo tic- type carbonic anhydrases proteins 

amino acids 197-245/ 104-140, 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCT GGTGGTT TGCC TAAA.CCTGCAAA.CATCACCT TCTTATCCATCAACATGAAGAATGTCCTACA 
ATGGACTCCACCAGAGGGTCTTC7VAGGAGTTAAAGTTACTTACACTGTGCAGTATTTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCT GTGAT GGGCTAT TC CATC TACCGATATATCCACGT T GGCAAAGAGAAACACCCAGCAAAT T T 
GAT T TTGAT T TAT GGAAAT GAATT T GACAAAAGATTCT T T GTGCCT GCT GAAAAAATCGTGAT TA 
ACT T TAT CACC CTCAATATC TCGGAT GAT TC TAAAAT TTCTCATCAGGATATGAG T T TAC TGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
ACAGTCAT TGAATAT GAATATGATGT CAGAACCACT GACAT T TGTGCGGGGCC TGAAGAGCAGGA 
GCT CAGT T TGCAGGAGGAGGTGTCCACACAAGGAACAT TAT T GGAGT CGCAGGCAGCGT T GGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGAC T CGGAGGAGGGGCCGGAGGAAGAGCCAT CGAC GAC CCTGGTCGACT GGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGTTATA 
T GT GCAGAT GGAAAAC TGA TGCCAACACT T CCTTTTGCCTTTTGTT T CCT GT GCAAACAAGT GAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 



FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQS SE FKAKI I FWYVLP I S I TVFL FSVMGYS I YRY IHVGKEKHPANL I L I YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTGAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
AT C T GG C T GAAGG T CAT CACAG C T AACAT C C T C CAGC T GCAGG T GAAGC C C T C GGC C AAT GAC CA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGGAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
T GAT CAAT GAAAAGGCT GCAGAT AAGCT GGGAT CTACCCAGAT C GT GAAGAT C CTAAC TCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T TTACACCAAAGGTGACCAACT TATACT CAAC TTGAATAACATCAGCT CT GAT CGGATC CAGC TG 
ATGAACT CT GGGAT TGGC TGGT TCCAACC T GATGTTC TGAAAAACAT CATCAC TGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWT FTLLCGLLAATL IQATLSPTAVLI LGPKVIKEKLTQELKDHNATS ILQQLPLLSAMREK 
PAGGIPVLGSLVNTVLKHI IWLKVITANILQLQWPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLWALAKQVMNLLVPSLPNL 
VKNQLCPVIEAS FNGMYADLLQLVKVP I SLS I DRLEFDLL YPAI KGDT I QLYLGAKLLDS QGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDV^ 

EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNN I S SDRI QLMNS G I GWFQPDVLKNI I TE 1 1 HS I LLPNQNGKLRSGVPVSLVKALG 
FEAAE S SLTKDALVLT PASLWKPS S PVS Q 

Important features of the protein: 
Signal peptide: 
amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGT CAGCC TGGCAGAGAGACTCTGAAAT GAGGGAT TAGAGGTGT T CAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGT CTGT GAAAAACAGT GGACAGCACCCCGAGGGCCACAGGT GAGGGACACT ACCACT GGATCGT 
GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCT^AGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAAT T T GT GCATGAAC TGAAATAAAACCAT CCTACGGTAT CCAGGGAACAGAAAGCAGGAT G 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 



FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS TLLGLPAD I QAAQAMMVT S S AI S S LAC 1 1 S WGMRC T VFCQE S RAKDRVAVAGGVFF I LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site, 
amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTGTCCCTCCTTTC 

CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 

GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 

GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 

ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 

CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 

CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 

ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 

GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 

TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 

AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

ACTGCCAGAC T CCT GCTGCCAAGCC T GCAAAGATGAGGCAAGT GAGCAAT CGGAT GAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 

AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 

CAGACCCAAGGGAGCAGGCAGCACAACT GT CAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 

GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 

CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 

CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 

ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 

CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 

CTCGGAC T TGG T GGAGATCTACCT CT GGAAGC TGGTAAAAGAT GAGGAAACT GAGGC T CAGAGAG 

GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 

GAAGCAAGACT T CCAGAAAGAGGCACAGCACT T CCGACTGC TC GC TGGCCCCCACGAAGG T CACT 

GGAAC GTCTT CC TAGC CCAGACCCT GGAGCTGAAGGT CACGGCCAGT CCAGACAAAGT GACCAAG 

ACATAACAAAGACC TAA.C AGTT GCAGATATGAGC TGTATAATT GT TGTTAT TATATAT TAATAAA 

TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 82 



MVPEVRVLSSLLGIJ^LWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S STRCPKAPGRVLVHT SVS PSPDNLRRFALEHEAS DLVE I YLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino * acids 1-25 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 

GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 

TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 

TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 

GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 

CC AGT T CT CT AGCG TGGT CC ACCTCT AC AGGG ACGGGAAGGACC AGC CAT TT ATGC AG ATGCCACAGT AT C 

AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 

GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 

AC AGGT GT C AGC ACT GGGCT CAGT TC CTCT CATT T C CATC ACGGGATATGTT GAT AGAGACATC CAGCTAC 

TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 

AC AGACTC C AGGAC AAACAGAGAC AT GC AT GGCCT GTT T GAT GT GGAGAT CTCT CT GACCGT CC AAGAGAA 

CGCCGGGAGC AT AT CCTGTT CCAT GCGGCATGCTC ATCT GAGCC GAGAGGT GGAAT CC AGGG T AC AGATAG 

GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 

TT TT T T GGCATT GT T GGACT GAAGAT TTT CTT CT CC AAATTCCAGTGGAAAAT C CAGGCGGAAC TGGACT G 

GAGAAG AAAGCACGGACAGGCAGAAT TG AGAG ACGCCCGGAAAC ACGC AGTGGAGGTG AC TCT GGATCCAG 

AG ACGGCTCACCCG AAGCTCTGCGTT T CT G AT CT GAAAACTGT AACCC AT AGAAAAGCTCCC C AGGAGGT G 

CCTC AC TCTGAG AAGAGAT T TACAAGGAAGAGT GTGGT GGCT TCT C AGAGTTT C CAAGCAGGG AAAC AT T A 

CT GGG AGGTGGACGGAGG AC AC AAT AAAAGGT GG CGCGT GGGAGTGTG CCGGGATG AT GT GG AC AGGAGG A 

AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 

ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 

TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 

AAGGCT TAT T GAGGCCCT AC AT TGAGT ATC CGTC CT AT AATGAGCAAAAT GG AACT CCCATAGTCATCTGC 

CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 

TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 

TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 

GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 

CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 

AACCGT CAGGAATTCCCATCT C ACAGGCTGTGGT GT AGAT T AAGT AGACAAGGAAT GT GAAT AATGCT TAG 

ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 



FIGURE 84 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPEMQMPQYQGRTKLVKDS IAEGRI SLRLENITVLDAGLYGCRI SSQS YYQKAIWELQ 
VSMjGSVPLISITCYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
P I VI CPVTQE SEKEAS WQRAS AI PE TSNSE SS S QATTP FLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 85 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 

GC T CTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACT GCT GACGAT GCAGAGT TCCGTGA 

CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 

TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 

AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 

GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 

TACT TCT T T CGT AT GGAGAAAGGAAGTAT AAAATGGAAT TATAAACATCAC CGGC T C TC T GTGAA 

TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 

AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 

GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 

GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 

GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 

CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

GAACGT CTCCC TGCAGAGCAAAGCCACAT CAGGAGTGAC TCAGGGGGTGGT CGGGGGAGC TGGAG 

CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 

TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 

CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 

AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 

ATGAG AAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 

GAGGC T GAT T C TTG TAGAATTAACAGCCCTCAACGTGAT GAGCT AT GATAACAC TAT GAAT TAT G 

TGCAGAGTGAAAAGCACACAGGCTT TAGAGT CAAAGTATC TCAAACC T GAAT CCACAC TGTGCCC 

T C C C T T T TAT T T T T T T AAC T AAAAG AC AG ACAAAT T C C TA 



FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQS SVTVQEGLCVHVPCS FSYPSHGW I YPGPWHGYWFREG 
7\NTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNI LI PGTLESGCPQNLTCS VPWACEQGTPPMI SWIGTSVSPLDPS TTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 

CCAAGGAAAGTGCAGCTGAGACT CAGAC AAG ATT ACAATGAACCAACT CAGCT TCCTGCTGTTTC 

TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 

TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 

TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 

GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 

GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 

CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 

ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 

AGAAACAGCT CCC TGCTGAGGT ACCGCACGGACAC T GGC T TCC T CCAGACACTGGGACATAATCT 

GT T T GGCAT CTACCAGAAATAT CCAGT GAAATATGGAGAAGGAAAGT GT T GGACTGACAACGGCC 

CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 

GGC CAGCGGGAATT CACT GCGGGAT T TGT TCAGT TCAGGGTAT T TAAT AACGAGAGAGCAGCCAA 

CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

GATACT T TCCAGAGGCCAGTCCCCAGCAGTGTGGAGATT T T TC TGGT TTT GAT TGGAGTGGAT AT 

GGAACT CAT GT T GGT TACAGCAGCAGCCGT GAGATAAC T GAGGCAGCT GT GC T T CTAT TC TATCG 

TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 

CAAC T TACCCAGTAGCTAGAAT GT TAAT GGCAGAAGAGAAAACAATAAAT CAT AT TGAC TCAAGA 

AAAAAA 



FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWT DNGPVI P WYDFGDAQKTASYYS PYGQRE FTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCI GGGGYFPEAS PQQCGDFSGFDWSGYGTHVGYS SSREI TEAAVLL FYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTGTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
TTAAGACT TATATACAGT TTTAGGGGACAATTAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGAraCAACTGAC 

TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 

GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 

CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGGTGCT 

GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 

CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 

GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 

CCAACACAAT GCCACAGGCCAGGGAAACAT C T CCAT CAGC C TC GTGCCCC CCAGTAAAGCTGTAG 

AGT TCCACCAGGAACAGCAGAT CT TCAT CGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 

GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 

CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 

TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 

CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 

TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 

AGGCAGGGAGGGGGGTGGAGACGAGGAGATGGCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 

AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 

AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 

TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 

CCTTGGTTCTT GCCATCC T GAGGAAAGATAGCAACAGGGAGGGGGAGATT TCAT CAGTGTGGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 

GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 

CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 

CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 

GCACACCCATC CT TAAGC TAAGACAGGACGATT GTGGTCC T CCCACACT AAGGCCACAGCCCAT C 

CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 

GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 

CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 

TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 

TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 



FIGURE Q2 

MQLTRCC FVFLVQGSLYLVI CGQDDGPPGSEDPERDDHEGQPRPRVPRKRGH I S PKS RPMANS T L 
LGLIAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGAT GGACCAACACAGAAATATG T GCT GAT CT T T GGAGCGT T TGT CTCT GT CTATATCCAAGAA 
ATGTT CCGAT TTGCATAT TATAAACT CTTAAAAAAAGCCAGTGAAGGT TT GAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 

GGAGAT T C T C CTCAAT TCTTCCTT TAT T CAGC T T T CAT GACGCT GG TCAT TATC T T GC TGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCT GAAAC TC T GCC T GCTCT GCCAAGACAAGAACT T TCT T CT T T ACAACCAGCGCTCCAGAT 
AACCT CAGGGAACCAGCAC T T CCCAAACCGCAGACTACATCTT TAGAGGAAGCACAAC TGT GCCT 
T T TT CTGAAAATCCCT TT T TCT GGTGGAAT TGAGAAAGAAATAAFiACTAT GCAGATA 
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MTAAVFFGCAFIAFGPALALWFTIAIEPLRIIFLIAGAFFWLVSLLISSLWFMARVIIDNKDG 
PTQKYLLI FGAFVS VY I QEMFRFAYYKLLKKASEGLKS INPGE TAPSMRLLAYVS GLGFG IMS GV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 
amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site* 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGG TAGCAATC TGAAACATGAGGAGTACGAT TC TACTGT T TT GTCT TCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTTTAAGTCTGATACCATTAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAAT T T T TGT CACACAACT T GGAGC CCAGGGCACTATCC TAAGCT CAGAGGAAT TGCCACAAAT C 
fafe TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
T AAT CCAGAT GTCCAGGATGGAAGCC T T CCAGCAGGAGGAGCAGGT GT AAAT CC TGCCACCCAGG 
ffi GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
III GCAGGCAT CCAAAGGAGCACACAT G CCAT CGAGGAAGCC ACCACAGAAT CAGC AAATGGAAT T CA 

W G TAA. GCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
J*; AT T GAT TAT AT TATGGAATAGAT T GAGACACAT TGGATAGTCT TAGAAGAAAT TAAT T CT TAAT T 

T AC C T G AAAAT AT T C T T GAAAT T T C AGAAAAT AT G T T C T AT G TAG AG AAT C C CAAC T T T T AAAAA 
H CAATAATTCAATGGATAZmTCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGGATAT 
Ifl TAAAACATATTTGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
AAAAJ^AAAAAAAAAAAAAAAAAAAAA^W^AA 

%y 

nj 
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FIGURE Q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMT PGTQTHPLTLGGLNVQQQLHPHVL P I FVTQLGAQGT ILS SEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDYQDGSLPAGGAGVNPATQGTPAGRLPTPSG 

TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAG TACCAGC TACAGGTCACCC TGGAGAT GCAGGAT GGACAT G T CT TGTGGGGTCCAC 
AGCCTGT GCT T GTGCACGTGAAGGAT GAGAAT GACCAGGT GCCCCAT T TC T CT CAAGCCATC TAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
lj. CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
III ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
y CCAGGCCT CAGGCCACCAGGCCACT GCCAC CGT GGAAGT C TCCATCAT AGAGAGCACC TGGGTG T 

%J CCC TAGAGCCTATCCACC T GGCAGAGAATC TCAAAG TCC TATAC CCGCACCACAT GGCCCAGGTA 

|fl CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
ill T GCAGAGGGAAACC TC TACGTGACCAGAGAGC T GGACAGAGAAGCCCAGGCTGAGTACC TGC TC C 

J% AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
% U . ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 

V CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
SI ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
a ' CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
fr % CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
J!? T CGAAGT CGCAGTCACAGATAT CAAT GATCACGCCCC TGAGT TCAT CACT T CCCAGAT TGGGCC T 

VI ATAAGCCTCCCTGAGGATGT.GGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
O CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
fjjj GCCT GGAT TGGGAGCCAGACTCT GGGCAT GT TAGACT CAGACT CT GCAAGAACC TCAG T TATGAG 
PI GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
J*; AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
Rf ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
T CTGCCCTGGGGT GGAGGCACCATCACCAT CAC CAGGCATGT CT GCAGAGCC T GGACACCAAC T T 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAA?^AAAAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRMjDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S T SLDHALERT YQLLVQVKDMGDQAS GHQATAT VE VS I IESTWVSLEPIHLAENLKVLYPHHMAQ 
VHW S G G DVH YHLE S H P P G P FE VNAE GNL YVT RE LDRE AQAE YL L QVRAQNS HGE D YAAP L E LHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTS GS VTLGVLPLRAGQNI LLLVLAMDLAGAEGG FS S TCE VE VAVT D INDHAPE F I TSQ I G 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI I PVWSHNAQMWQLLWVI VCRC1STVEGQCMRKVGRMKGMPTKLSAVG I LVGTLVAI GI FLI 
L I FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 

AGT CCAAGATTCT TCCCAGGAACACAAACGTAGGAGACCCACGCT CCTGGAAGCACCAGCCTTTA 

TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 

AT CTAGGACCCAGGCAT C TT GC TT T CCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAAT G 

T T CT CCTT AT GT TT GGTC TAC TAT T GCAT T TAGAAGCT GGAACAAAT TCCAATGAGAC TAGCACC 

TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 

TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 

GCAT AGT CACCAACTCT GAGT T CCATACAACC T CCAGTGGGAT CAGCACAGCCACCAAC TC TGAG 

TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 

GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 

CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 

AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 

CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 

CC TCCAGT GGGGCCAGCACAGC CACCAAC TCT GAGTCCAGCACAGT G TCCAGT AGGGCCAGCACT 

GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 

AACGACC T CCAATGGGGCT GGCACAGCCACCAACT C T GAGTCCAGCACGACCT CCAGTGGGGCCA 

GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 

TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 

GGCTAGCACAGCCACCAAC TCT GACTCCAGCACAACC TCCAGT GGGGCCGGCACAGCCAGCAACT 

CTGAGTCCAGCACAGTGTCCAGTGGGAT CAGCACAGTCACCAAT TCTGAGTCCAGCACACCC TCC 

AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 

CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 

CC T C CAGT GGGGTCAGCACAGCCACCAAC TCTGAGTCCAGCACAACC T CCAGT GGGGC TAGCACA 

GCCAC CAAC TCT GAC T CCAGCACAAC CTC CAGT GAGGCCAGCACAGC CACCAAC TCT GAGT C TAG 

CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 

ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 

ATGCACACAAC T TCCCAT AGT GCAT C TACT GCAGTGAGT GAGGCAAAGCC TGGT GGGT CCC TGGT 

GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 

TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 

CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 

GT GGAGT CCT AAC T GGTT C TGGAGGAGAC CAGTATCAT C GATAGCCAT GGAGATGAGCGGGAGGA 

ACAGCGGGCCC TGAG CAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 

CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 

AATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 

T GCT CAT T T AGC TAAGAAATAAATACATCT CATC TAACACACACGACAAAGAGAAGCT GTGC T T G 

CCCCGGGGT GGGT AT C TAGC TCT GAGATGAACT CAGT TATAGGAGAAAACCT CCAT GC TGGAC TC 

CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCAA 

AAJVAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MKMQKGNVLLMFGLLLHLEAJVTNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSG7^NTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTS SEAS TATNSESSTVS SGI STVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGTTTCTAGCTACTGCACGTACTTCATAAAGCAGGACTCTAAAAGCTTTGGAATCAT 
GGT GT CATGGAAAGGGAT T TAC TT TATACT GACTC TGT T T TGGGGAAGCT T T T T TGGAAGCAT T T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GATTATAACTGGGGATGCATTTGTTCCTGGAGAAAGAAGTGTCATTATCATGAACCATCGGACAA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CTT CATT CATAGGAAATGGAAGGATGACAAGAGCCAT T T CGAAGACAT GAT TGAT TAC TT T TGTG 
ATATT CACGAACCAGT T CAACT CCTCATAT TCCCAGAAGGGAC T GAT CT CACAGAAAACAGCAAG 
T CT CGAAG TAAT GCAT TT GCTGAAAAAAAT GGAC T T CAGAAATATGAATAT GT T T TACATCCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAAT CCAC TT TCACG T CCACCGG T ATCCAATAGACACCCT CCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAAT T T T TATT T TACCGGACAGAGT GT CAT TCCACCT TGCAAGT C TGAAC T CAGGGT C 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATAT T TGTACAGTCT TGT TAAG TGGTATT T TATAAT CACCATT GT AATC T T T GTGC T GCAAGAGA 
GAAT AT T T GGTGGAC T GGAGAT CAT AGAACT T GCATGT TACCGACT T T T ACACAAACAGCCACAT 
T TAAAT TCAAAGAAAAATGAGTAAGAT TATAAGGT T T GCCATGTGAAAACCTAGAGCATAT T T TG 
GAAAT GT T CT AAACC T TTC TAAGCT CAGAT GCAT T T T TGCATGACTAT GT CGAATATT TCT TACT 
GCCAT CAT TAT TTGT T AAAG AT AT T T T GCACT TAAT T TT GT GGGAAAAAT AT TGC TACAATT TTT 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GG G C CAGAATAT TAT T AAACAAT CAT C AGGC TTT T AAA 



FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKIGLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNI PQSEKHLLQGDFPRE IH FHVHRYP I DTLPT SKEDLQLWCHKRWEEKEERLRS FYQGEKNF 
YFT GQSVI PPCKSELRVLWKLLS I LYWTLFS PAMCLL I YLYS LVKWYFI I T I VI FVLQERI FGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 1Q3 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 

TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 

TCC ATGG CTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 

CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 

TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 

CACC T CT ACAGAGAT GGGGAAGACT GGGAAT CT AAGCAGATGCCACAGTAT CGAGGGAGAACTGA 

GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 

ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 

CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 

GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 

AG GAT T T G T C T T C AGAC T C C AG AG C AAAT G C AGAT G G G T AC AG C C T G T AT GAT G T GG AG AT C T C C 

ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 

GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 

TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 

AAAT CCAAAGGGAAAATCCAGGCGGAACT GGACTGGAGAAGAAAGCACGGACAGGCAGAATTGAG 

AGACGCCCGGAAACACGCAGT GGAGGT GACT CT GGAT CCAGAGACGGC TCACCCGAAGC T C T GC G 

T T T CT GATC TGAAAAC TGTAACCCATAGAAAAGC T CC CCAGGAGGT GC CT CACT CTGAGAAGAGA 

T TTACAAGGAAGAGTGTGGTGGC TT C T CAGGGT T TCCAAGCAGGGAGACATTAC T GGGAGGT GGA 

CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 

ATGT GACT T T GT CT CCCAACAAT GGG TAT T GGGT CC T CAGACTGACAACAGAACAT T T GTAT T TC 

ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 

GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 

T GACATGT CAGT TT GAAGGC TT GT TGAGACCC TATATCCAGCAT GCGATGT ATGACGAGGAAAAG 

GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 

CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 

CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 

TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 

TCAGAAGCCATGGCTGCCC TGAAGT GGGGACGGAATAGACTCACATTAGGTT TAGTTTGTGAAAA 

CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 

ACAGT GAT T CC T GCC TCACAGG T GAAGAT TAAAGAGACAACGAAT GT GAATCATGC TT GCAGGT T 

TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 

TGCT TATTCCACATTAAT TTACT TT TCTCTATACCAAATCACCCATGGAATAGT TATTGAACACC 

T GCT T T GT GAGGCT CAAAGAATAAAGAGGAGGTAGGAT T T T TCAC T GAT T C TATAAGCC CAGCAT 

T ACC T GATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACT ACAGGTCCATATCC 

CTCATTAACACAGACACAAAAATTCTAAATAAAATT TTAACAAAT TAAACTAAACAATATATT TA 

AAGAT GATAT ATAAC TAC TCAGTGT GGT T T GTCC CACAAAT GCAGAGT T GG T T TAATAT T TAAAT 

AT C AAC C AG T G TAAT T C AGC AC AT TAATAAAG T AAAAAAG AAAAC C AT AAAAAAAAAAAAAAA 



FIGURE 104 

mFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSMlAMEVRFFRNQFHAVVH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LCS I HLAEQSHEVE SKVL I GET FFQPS PWRLAS I LLGLLCGALCGWMGM 1 1 VFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSP^JNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE IPS 

CCTTC ACAGG ACTCTT C ATT GCTGGT TGGC AATGA T GT ATCGGC C AGATGTGGT GAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GT TCAT T ATGTGAG AT AT AATCAAAAGAAGACCT AC AATT ACT AT AGC AC AT TGTC AT TT AC AAC 
TG AC AAACTAT ATGCTGAGT TTGGC AGAGAGGCTT C TAAC AATT TT AC AG AAAT GAGC CAGAGACTTG AAT 
CAAT GGTGAAAAAT GC AT TT T AT AAAT CTCCATT AAGGGAAG AATTTGT C AAGT CT CAGGTT ATC AAGTT C 
AGTC AACAGAAGCAT GGAGT GT TGGC T CAT AT GCTGTT GATT TGT AGATT T C ACTCT ACT GAGGATCC TG A 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
ATC CTC ACTC AGT T AAAATT AAAAAAATCAAC AAGACAGAAACAGAC AGCT ATCT AAACC AT TGCTGC GG A 
AC AC GAAGAAGT AAAACT CT AGGTCAGAGT CT CAG GAT CGT T GGTGGGAC AGAAGT AGAAGAGG GT GAAT G 

U 

jr*! GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
ff\ TTGT GAGTGC TGCT C ACT GT TTTACAAC AT AT AAGAAC CCTGCC AGAT GGACTG CT TCCT TTGG AGT AAC A 

III ATAAAACCTT CG AAAATGAAACGGGGTCT C C GGAGAAT AATT GT C CAT GAAAAATACAAACACCCAT C AC A 

|f| TGACTATGAT AT TTCT CT TGCAGAGC TT TCT AGC CCTGT TCCCT AC ACAAAT GC AG TACAT AGAGTTT GT C 

%B TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
"'H GATGGT T ACAGT CAAAAT CATCT T CG ACAAGC AC AG GT GACT CT C ATAGACGCT ACAACT TGC AAT GAACC 

TCAAGC TT AC AATGACGC CAT AACT CC T AG AATGT T AT GT GCTGGCTCCT TAGAAGGAAAAAC AGATGCAT 
T & GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 

I n 

%l AGCT GGGGAG AT GAAT GT GCGAAACCCAAC AAGC CT GGT GTT T AT ACT AGAGT T ACGGCC TT GCGGGACTG 

Tl GATT ACTTCAAAAACT GGT AT C TAAG AGAC AAAAGC CT C ATGGAAC AG AT AACATT TTTT TT TGTTTTTTG 

f*\ GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 
f|| ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 

GATC AACTCT GT CATCTGTGAGCAAT AG TTGAAACT TT AT GT AC AT AG AGAAAT AG AT AATACAAT AT TAC 
ATT ACAGCCT GT ATT CAT T TGT TCTCT AGAAGTT T T GT CAGAAT TT TG AC TT GT TGAC AT AAAT TT GT AAT 
GC AT AT AT AC AATT T GAAGC ACTCCT TT TC TTCAGT T CCT CAGC TCCT CTCAT T TCAGCAAAT ATCC AT T T 
TC AAGGTGCAGAAC AAGG AGT G AAAGAAAAT AT AAGAAGAAAAAAATCCC CT AC AT TT T ATT GGC ACAGAA 
AAGT AT TAGGTGTT T T TCT T AGTG G AAT AT T AGAAATGAT CAT AT T CATT AT GAAAGGTC AAGC AAAG AC A 
GC AGAAT ACC AATC AC TTCATC AT T T AGGAAGT ATGGGAACT AAGT T AAGGAAGT C CAGAAAGAAGCC AAG 
AT AT AT CCT TAT T T TCATTTCCAAAC AACT ACT ATG AT AAAT GTGAAGAAGATT CT GT TT TTT T GT G ACCT 
AT AAT AAT T ATACAAACT T CAT GC AAT GT ACT T GT T CT AAGC AAAT T AAAGCAAAT AT T T AT TT AACATT G 
T T ACT GAGGAT GTCAACAT ATAAC AAT AAAAT AT AAAT CACCCA 
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mYRPDVVRARKRVCWEPWVIGLVIFISLIVLAVCIGLTVHYVRYNQKKTYNYYSTLSFTTDKLY 

AE FGRE ASNNFTEMS QRLE SMVKNAFYKS PLREEFVKS QVIKFS QQKHGVLAHMLL I CRFHS TED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGS HRCGATL INATWLVSAAHC FT T YKNPARWTAS FGVT I KPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
. CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACA ATG GCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT GT T CT CACT T TAACTGCAGAAACGAGGGTAGAGGAAGCAGT CATT T T GACT TACT T TCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACT GGCT TGTGGCGT TTGGACATATGAACAGGAACT TATGGT TCCAGTACAATGGTCA 
GAT AT GG T CAC T T T GAAAGCCAGGAT GACAAAT TATGGAT T ACCTAGATAT CGGTGGCT TACTCA 
i s =. TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
L :; AAAT GACAGAGATGGAC TGGCCCCCAGAT TCCTGCTGTGT TAGAGAAT T CCCAGGAT GT TCCAAA 

CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
Q T TT GAGAGGAACCAAACAACTGCAGGT GCT GAGGTT TC TGGGAATCT C CAT T GGGG T GACACAAA 

pi TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
III ACAGACCAAAT GAT GT CC T TGAAGAATGACAAC T CT CAGCACCT GT CATGT CCCTCAGTAGAACT 

ii\ GT TGAAACCAAGCC TGTCAAGAATCT TTGAACACACATCCATGGCAAACAGCTTTAATACACACT 

HI TTGAGATGGAGGAGTT ATAA AAAGAAATGTCACAGAAGAAAACCACAAACT TGTT TTAT TGGACT 

TGTGAATTT TTGAGTACATACTATGT GTT TCAGAAATATGTAGAAATAAAAATGTTGCCATAAAA 
\| TAACACCTAAGCATATACTATT CTATGCT TTAAAATGAGGATGGAAAAGTTTCATGTCATAAGTC 

s ACCACCT GGACAATAAT T GATGCCC T TAAAATGC TGAAGACAGAT GT CATAC CCAC TG TGTAGCC 

TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
TCCAT GCAAAC GAGT CACATAT GGT GGGACT GGAGC CATAGTAAAGGT TGAT TTAC TTC TACCAA 
CTAGTATATAAAGTACTAATTAAAT GCTAACATAGGAAGT TAGAAAATACTAATAACT TTTATTA 
O C TCAGCGAT CT AT TCT TC T GATGCT AAAT AAAT TATATAT CAGAAAAC T T T CAATATT GGTGACT 

b| ACCTAAAT GT GAT TTTTGCTGGT TAC TAAAATAT TC T T AC CAC T TAAAAGAGCAAGC TAACACAT 

p| TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 

T T CAGT TCTGATAATG TTAAGAATAACCAT TATGAAAAGGAAAAT T T GTCC TGTATAGCAT CAT T 
111 ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 

T G T TAT T T AAAT AC T T AAC CAC T AAT T T T GAAAAT TAC C AG T G T G AT AC AT AGG AAT CAT TAT T C 
AGAAT GTAG TC T GGT C T T TAGGAAGTAT TAATAAGAAAAT T TGCACATAACT TAGT TGATT CAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCT TAT CTTTGCCTTCTC C AAAC AAG AAGC AAT AG T C T C C AAG T C AAT AT AAAT T C TAC AG AAAA 
TAGTGTTCT TT TTCTCCAGAAAAATGCTTGT GAGAATCAT TAAAACATGTGACAAT TTAGAGATT 
CTTTGTTTTAT TTCACTGATTAATATACT GTGGCAAAT TACACAGATTATTAAATT TT TT TACAA 
GAG TAT AGT AT AT T TAT TT GAAATGGGAAAAGT GCAT T T TACTGTAT T T T GT GTAT T T T GT T TAT 
T TC T CAGAATATGGAAAGAAAAT TAAAAT GT GTCAATAAATATT T T CT AGAGAGTAA 
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MZ^EDSVKCLRCLLYALNLLFWM 
VMIAVCCFLIIVGMLGYCGTVKRN^ 

LKARMTNYGLPRYRWLTHAWNFFQREFKCCGWYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYS-FLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT CT T TCT C GT GAGAGCC T AG AGGCCT T AAAAAAAAAAGT GCT T GAAAGAGAAG GGGAC AAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 

CCT GGAGGAAC ACT GACC AC C AGTT GGAT GAGT CT C AAGGT CC T CCT CT AT GT GAC AACC AT GT GAAT GGG GAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
CAGCTGCTCAGATACCAGCGAGTGCAC ATGC GCTCCAGGAACT GTGCTAGGCCCTGACAGGCAGACAT GCTTTG 
AT G AAAAT GAAT GT GAGC AAAAC AACGGT GGCT GC AGT GAGAT CT GT GT GAACC T C AAAAACT CC T AC CGCT GT 
GAGTGT GGGGT TGGCCGTGT GCT AAGAAGT GATGGCAAGACTT GTGAAGACGTTGAAGGAT GCC ACAAT AAC AA 
TGGT GGCT GCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGT GAAT GTCCCCGGGGCCTGGT GCT GT 
CTGAGGATAAC C AC AC T T GC CAAGT CCCTGT GTTGTGCAAATC AAAT GCCAT TGAAGT GAACATCCCCAGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCT C AAGAC AT GT GGT AC AGT G GT C GAT GT G GT GAAT GAC AAG ATT GT GGCCAGC AACC T C GT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACC T GC GAGT T TCC AC GCC T GT ACACC AT T T CT GAAGGAT AC GTT CC C AACC T T CGAAAC T CC C C ACT GGAAAT 
CAT GAGC C GAAAT CAT GGG ATCT TCC C AT T C ACTC T GGAGAT C T T C AAGGAC AAT GAGTT T GAAGAGC CT T ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTT GGT GGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCT GAAAT ACTACCTC AT 
C CGG GAT GGCT GTGT TT C A GAT GAC T C G GT AAAGC AGT ACAC AT CCC GGGAT C AC C T AGC AAAGC ACT T CC AGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGAC TAG TTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
T GGGTT C AGACT T C ACAC T GT GAGTT CAGAC T CCC AGC AC C AACT CAC TCT G ATT C T GGT CC AT T C AGT GGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
C CC AGAAAGAC ACT C ACC CC AT TTCCCTCATTTCTTTCC T AC AC T T AAAT AC CTC GT GT AT GGT GC AAT CAGAC 
CAC AAAAT C AG AAGC TGGGT AT AAT AT T T CAAGT T AC AAACC CTAGAAAAAT T AAACAGT T AC T GAAAT TAT GA 
CT T AAAT AC CC AAT GAC T CCT T AAAT AT GT AAAT TAT AGTT AT ACCT T GAAAT TT C AATT C AAAT GC AGACT AA 
T TAT AGG GAAT TT GG AAGT GT AT CAAT AAAAC AGT AT AT AAT T T T 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGH FYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHWILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CC TGCT GCCTTGGGGTGACAATCT CAGCT C C AGGCT AC AGGGAGACCG GG AGGAT C AC AGAGCC AGC ATGT 
T ACAGG AT CCTGAC AGT GAT C AAC CT CT GAAC AGCCTCG ATGT C AAACCC CT GC GC AAACCC CGTAT CCCC 
ATGG AG ACCTTC AG AAAGGT GGGG AT CCCC AT C ATC AT AGCACT ACTG AGCC TGGCGAGT AT CATC AT TGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGC AGCT GT GTGACGGAGAGCTGGACT GT CCCT TGGGGGAGGACGAGGAGC ACT GT G TC AAGAGC TT C 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GC TACAGCAGAGCT GT GG AGATTGGCCCAGACCAGG AT C TGGAT GTTGTT GAAATC AC AGAAAAC AGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGG AAGAGC CTGAAGACCCCCCGTGTGGTGGGT GGGG AGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
T C AGC AT CCAGT ACGACAAACAGC AC GT CT GT GGAGGGAGC ATC CTGGAC CCCC ACTGGGTCCT CACGGC A 
GC CC ACTGCT T C AGGAAAC ATACCGATGTGTT CAAC TGGAAGGT GCGGGC AGGC TCAGAC AAACT GGG CAG 
CTTCCC ATCC CT GGCT GTGGCC AAGAT CAT CATC AT TGAATTCAACCCCATGT ACCCCAAAG AC AAT G ACA 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
G ATGTCTGAC AT ACT GCT GC AGGCGTCAGT CCAGGTC AT T GACAGC AC AC GGTGCAAT GC AG ACGATGCGT 
ACC AGGGGGAAGTCACCGAGAAGATGAT GT GTGCAGGC AT CCCG GAAGGGGG T GTGGACACCTGCCAGGGT 
GAC AGT GGTG GGCCCCTGAT GT ACCAATCT GACC AGTGGC AT GT GGTGGGC AT CGT T AGC TGGGGC TATGG 
CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTG TAAT GCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TTCT TGGAGC AGC AAAGGGCCTCAAT T C CT GT AAGAGACC CT CGCAGCCC AGAGGCGCCC AGAGGAAGT C A 
GCAGCCCT AGCT CGGC CAC ACT TGGT GC TCCC AGC ATCCC AGGGAG AG ACAC AGCC C ACT GAACAAGGT CT 
C AGGGGTATT GCTAAGCCAAGAAGGAACTT TCCC AC AC TACT GAAT GGAAGC AGGCT GTCTT GTAAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCT AC TAGAGC AAGAAACCAGTT GTAAT ATAAAAT GCAC TGCC CT ACT GTTGGTAT GAC TACCGT TACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWLTAAHCFRKHTDVF1MKVRAGSDKL 
GS FPSLAVAKI 1 1 1 E FNPMYPKDNDI ALMKLQFPLT FS GTVRP I CLP FFDEEL TPAT PLWI I GWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 113 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 

TGAACTGCTAGGAT TC T GAC TATGC T GT GGT GGC TAGT GC TCCTAC TCCTACC TACATTAAAAT C 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
N CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
ffl GAAGCAGT GGGT GAGAC AT C AC GCT GCCCGCCCAT C TAAC CT T T T C AT GT CCTGCACAT CACC T G 

|;J ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
V! CAGAAGGGGTC TGC T TAGACCACCT GGTT TAT GTGACAGGACT T GCATT CT CCT GGAACATGAGG 

?^ GAACGCCGGAGGAAAGCAAAGT GGCAGGGAAGGAAC T TG TGCCAAAT TAT GGGTCAGAAAAGAT G 

GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
#1 CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
y| C CACGTACCAGCTGC TGAAGGGCAAC TGCAGGCCGATGC T C T CATCAGCCAGGCAGCAGCCAAAA 

Q T C T GCGAT CAC CAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGT GACCAT TTC TCC TCCC 

|Jf CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
jSj GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 

GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
C TCAAT T TAAAT CATGT TCTAGTAAT TGGAGCTGTCCCCAAGACCAAAGGAGC TAGAGC TTGGT T 
CAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGC TACT CGGGAGGC TGAGACAGGAGAAT TAC TT GAACCTGGGAGGTGAAGGAGGCT GAGACA 
GGAGAATCACTTCAGCCTGAGCAACACAGCGAGACTC TGTCTCAGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 
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FIGURE 114 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPIAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 

C AG C AG TGGTCTCT C AG T C C T C T CAAAG C AAG GAAAGAG T AC TGTGTGCT GAG AGAC C ATGG C AA 
AGAATCCTCCAGAGAATTGTGAAGACT GT CACAT TC TAAATGCAGAAGC T TT TAAAT CCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
T CTACAGC AATGGAGAGAAGAAGAAGAT T TACATGGAAAT TGATCC T GT GACCAGAAC TGAAATA 
TTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CTGAACCAGAAGAGGAAATAGAT GAGAAT GAAGAAAT TACCACAAC T T TC T T TGAACAGTCAGTG 
AT T T GGGT CCCAGCAGAAAAGCCTAT TGAAAAC CGAGAT T TTCT TAAAAAT T CCAAAATTCT GGA 
GAT T T GT GATAACGTGAC CATGT AT TGGATCAATC CCACTCTAATAT CAGT T TC TGAGTTACAAG 
ACT TT GAGGAGGAGGGAGAAGATCT T CACT T T CC TGCCAACGAAAAAAAAGGGAT TGAACAAAAT 
GAACAG T GGGTGGT CCC TC AAGTGAAAGTAGAGAAGACCCG T CACGCCAGACAAGCAAGTGAGGA 
AGAACT TCCAATAAATGACTATACTGAAAATGGAATAGAATTT GATCCCAT GCTGGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTC TAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TAT C AC C AAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HT FYSNGEKKK I YME I DPVTRTE I FRS GNGT DE TLEVHD FKNG YTG I YFVGLQKC F I KTQ I KVI P 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-4 0 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 

TAT TTC ACC AT C CT GGGACT TCC AGC CATGCT GC AGGC AGT GCGAGCCCT GATGATCGT AGGCAT CGT CCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TC TGTGTT T GCC AACATGCT GGT G AC T AAC TT CTGGAT GTCC AC AGCT AACATGT ACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
CTT CCAAGC ACG ACTATGTGTAATGC TCT AAG ACCT CT CAGC ACGGGCGG AAGAAACTC C CGG AGAGC TC A 
CCCAAAAAAC AAGG AG AT CC CATCTAGATT TCTTCT TGCTTT TGACTC AC AGCT GGAAGT T AGAAAAG CC T 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTGAGTT AT TT AT GAAT T AGAGGCT AT AGCT CACATT T TCAAT CCT CTATTT CTT T TTT T AAAT ATAAC T 
TTC T ACTCTGAT GAGAGAAT GTGGT T TT AATCTCTCTCTC AC AT TTTG AT GAT T TAGAC AGACTCCCC CTC 
TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
AC ACTT ACT GAAGAAGAAGC AAT AAGAGAAAGAT AT TT GT AATCT CTCCAGCCC AT GATCTCGGT T TT CTT 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
T TGACATCT T CT TAT T AC AGCAAC ACC ATT CT AGGAGT TT CCT GAGCT CT CCACTGGAGT CCTCTT TC TGT 
CGCGGGTCAGAAATT GTC CCTAGAT G AATG AGAAAATT AT TT T TTT T AAT T T AAGT CCT AAAT ATAGT T AA 
AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAG 
GAAAT GAAAAAATAAT TGCTTT GAC ATT GT CT AT AT GGT ACT TT GTAAAGTCAT GCT T AAGT AC AAAT TCC 
AT GAAAAGCT CACACCTGT AAT CCT AGCAC TT TGGG AGGCTG AGGAGG AAGG AT CACT TGAGCCCAGAAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
TGGT GGC ATACACCTGT AGTCCCAGC AT TC CGGGAGGCT GAGGTGGGAGGAT CACT TGAGCCC AGG GAGGT 
T GGGGC TGCAGT GAGCCAT GAT C ACACC ACT GC ACT CCAG CC AGGT GACAT AGC GAGATC CTGT CT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 



FIGURE 118 



MS TT TCQ WAFLLS I LGIAGC I AATGMDMWS TQDLYDNP VT S VFQ YEGLWRS C VRQS S G FTECRP 

YFTILGLPAMLQAVRALMIVGIVLGAIGLLV^ 

CAIAGVSVFANMLVTNFWMSTAJSIM^ 

CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE no 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 
T CTAC TCCAGAAGTCAGTAT GTGTAGTT GTGTATGT TTT TT TAACTT TACTATAAAGCCATGCAAAT G 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AATCTTAATTACAGGAACTGTGCATCAGCTATTTATGATTCTATAAGCTATTTCAGCAGAATGAGATA 
T TAAACCCAAT G C T T T GAT T G TT C T AGAAAGT ATAGT AAT TTGTTTTC T AAGGT G G T TCAAGCAT CT A 
CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
ACGACATAGCATTAT GTACATAGATGAGTGTAACATTTATATCTCACATAGAGACAT GCT TATAT GGT 
T TTAT TTAAAATGAAAT GCCAGTCCATTACACT GAATAAATAGAACTCAACTATTGCTT T TCAGGGAA 
ATCAT GGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAAT GTCCTCCA 
TTTATAAT GAAGATTAAAAT GAAGGCT TT AATCAGCATT GTAAAGGAAATT GAATGGCT T TCT GATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATAT T TTTGTTT TT GTATTT GAAGAAGAATGAT GCATT TTGACAAGAAAT CATATAT GTATGGAT 
ATATTTTAATAAGTAT T TGAGTACAGACTTT GAGGT TTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACATT TATATTAAT AAATTGTACATT TTTCTAATT 
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FIGURE 120 



MATHALEIAGLFLGGVGMVGW 
IYDSLLALSPDLQAARGLMCAASVM 

MWL I PVSWVANAI I RDFYNS I VNVAQKRELGEALYLGWTTALVL I VGGALFCCVFCCNEKS S S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

I Transmembrane domains: 

\ . . 

* ammo acids 82-101, 118-145, 164-188 

# 

'i. 

I 

J .. . 



I 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGTCCTGGACACCCAACTAC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAAT GCAT GC TG TCAGCGT TGGTAT T T CACAT T CAAT GGAGCT GAAT GTT CAGGACCTC TTCCC 
AT TGAAGC TATAATT TATT T GGAC CAAGGAAGCCCTGAAATGAAT T CAACAAT TAATATTCATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
T T GGCACT TGT T CAGAT TACCCAAAAGGAGATGC TTCTAC T GGAT GGAAT TCAGTT TC TCGCATC 
AT TATT GAAGAAC TACCAAA ATAAA TGC T T T AAT T TT CAT T TGC T ACCTCTT TT T T TAT TATGCC 
TTGGAATGGTTCACTTAAATGACAT TT TAAATAAGTT TATGTATACAT CTGAATGAAAAGCAAAG 
C T AAATAT GTT T ACAGAC CAAAG TGT GAT T T CAC AC T G T T T T T AAAT C TAG CAT TAT T CAT T T T G 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAAT ATAAAAGC TACCAAT CT TT G TACAATT TGTAAATGTTAAGAATT T T T T TTATATC TGT 
T AAAT AAAAAT TAT T T CCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSMRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT TGCC TGGAAGAATACATCATGT T TT TCGATAAGAAGAAAT T GT AGGATCCAGT TTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
T CT T C C TGGG TGTCAGCGAGCCCT GAC T CACTACAGT GCAGCT GACAGGGGCT GT CATGCAAC TG 
GCCCC TAAGC CAAAGCAAAAGAC CTAAGGAC GACC TT TGAACAATACAAAGG ATG GGT T TCAAT G 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
'* ATAACAGCCTTCAAAAACTTAAGTATAATCAATTTAAAGGGCTCAACCAGCTCACCTGGCTATAC 
| C T T GACC ATAACCATATCAGCAATAT T GACGAAAATGC T TT TAAT GGAATACGCAGACTCAAAGA 

J GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
I TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
I AGACT GCCGCAACCTGGAAC TT T T GGACCT GGGATATAACCGGAT C CGAAGT T TAGCCAGGAAT G 

TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGTCC TGGACCT GGAGC TCC T TACAAAGGC T T GAT T TATCAGGCAATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
T TAAAGGT C TAAGGGAGAATACAAT TAT C T GTGCCAGTCCCAAAGAGC T GCAAGGAGTAAAT GT G 
AT CGAT GCAGTGAAGAACTACAGCATC TGT GGCAAAAGTACTACAGAGAGGT T TGATC TGGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
I TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
I TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
| CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAG T CCC TAAAGCAAAT GAC T CCCAGCACCCAGGAATT T TATG TA 
GAT TATAAACC CACCAACACGGAGAC CAGCGAGATGC TGC TGAAT GGGACGGGACCCT GCACCTA 
TAACAAATCGGGC TCCAGGGAGT GTGAGGTATGAACCAT TGT GATAAAAAGAGCT C TTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GG TCAT T T TCC TC TCATACATAAT CAACCCAT TGAAAT T TAAAT ACCACAAT CAATGTGAAGCTT 
GAACT CCGGTT TAATATAATACCTAT T GTATAAGACCC T TTACT GAT T C CAT TAATGTCGCAT T T 
GT TT TAAGATAAAAC T TCT T TCATAGGTAAAAAAAAAAA 
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FIGURE 124 

MGFOTIRLLSGSAVALVIAPTVLLTMLSSMRGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LS LRYNS LQKLKYNQFKGLNQLT WL YLDHNH I SNI DENAFNGI RRLKEL I LS SNR I S YFLNNT FR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARWFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVI 

SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKSFKGLRENTIICASPKELQGVWIDAVKNYSICGKSTTERFDl^ARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12* 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATTGAAGAGGAAGGCT TTCTAAAGCT TTGGCAAGGAGTGACACCCGCCATT TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T T AGCCAAT C CAAC TGACCTAGT GAAGGT TCAGAT GCAAATGGAAGGAAAAAGGAAAC T GGAAGG 
AAAACCAT T GC GATT TCGTGG TGTACATCATGCAT T TGCAAAAAT CTTAGCT GAAGGAGGAATAC 
GAGGGCT T TGGGCAGGC TGGGTACCCAATATACAAAGAGCAGCACTGG T GAATAT GGGAGAT T TA 
AC C AC T TAT GATACAG T GAAACACT AC T T G GTAT T GAAT AC ACC AC T T GAG GACAAT AT CAT GAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CAAAAGCAGAATAAT GAAT CAACCAC GAGATAAACAAGGAAGGGGACT T T T GTATAAAT CAT CG 
AC TGACTGC T TGAT TCAGGC TGT T CAAGGT GAAGGAT TCATGAGT CT ATAT AAAGGC T T T T TACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAEIATFPLDLTKTRLQMQGEMIARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVVYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGVI GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKI LAEGGI RGLWAGWVP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLL YKS S TDCL I QAVQGEGFMSLYKGFLPS WLRMT PWSMVFWLT YEKIREMS GVS P F 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACATCC T C TCCATGT TCT CAGC TATCCGT TCT CAGCACAGCGGT GTAGACAT C 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGT GGACGATGGGCACAT CAT TAACAT CAATAGCATGT CTGGCCACCGAGTGT TA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 

CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAVA^ 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGIAKPOT^ 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
I S PGWET Q FAFKLHDKDPEKAAAT YE QMKCLKPE DVAEAVI YVL ST PAH I Q I GD I QMRP TEQVT 

Important features of the protein: 
Signal peptide: 
amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase. 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 

CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 

ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 

CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 

GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 

CCGT T GAGT GC T TGAAATGAGGAACT GAGAAAAT TAAT T T C TCAT GTAT T T T TC T CAT T TAT T TA 

T T AAT T T T T AAC T GATAGT TGT ACAT AT T T GGGGGT AC AT GT GATAT T TGGATACATG TATACAA 

TATATAATGATCAAATCAGGGTAACTGGGATATCCATCACATCAAACATTTATTTTTTATTCTTT 

T TAGACAGAGTC T CAC T C TGT C AC CCAGGC T GGAGTGCAG T GGT GCCAT C T CAGC T TAC TGC AAC 

CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 

GCACCAC7\ATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 

CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 

GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 

ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 

ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 

AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 

AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 

TTCCATCCATGTTGCTGCAAATGACAGGATTTCGTTCTTAATTTCAATTAAAATAACCACACATG . 

GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE lai 



T T CT GAAGT AACGGAAGCT ACCT TGT AT AAAG ACCTCAAC AC TGCT GAC CATGATC AGCGCAGC CT GG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
■ T T GAAAAACT TGCT GAAAGT AG AAAG AATATACCT ATACC AC AAC AGT T TAG AT GAAT TT CCT ACC AACCT 
CC CAAAGT AT GTAAAAGAGT TACATT T G CAAG AAAATAAC AT AAGG ACT AT CACTT AT GATTC ACT TT CAA 
AAATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GC CC AGGACT AT AGAAGAACTACGCT TGGATG AT AATCGC AT ATCCACT ATT TC AT CACC AT CT CT TC AAG 
GT CT CACT AGTCT AAAACGCCT GGT T CT AG AT GG AAACCT GT TGAACAATCATGGTTT AGGT GACAAAGT T 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
T CCAGGCACAAACCT GAGGAAGCT TT AT CT TCAAGAT AAC CACATC AAT CGGGT GCCC CC AAATGCTT TTT 
CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
GATGAT TTGGAC AATATAAC ACAACTGATT CTTCGC AAC AAT CCCT GGT ATT GCGGGT GC AAGATGAAAT G 
GGTACGTGACT GGT TACAATCACT AC CT GT GAAGGT C AAC GT GCGT GGGCTCAT GTGCCAAGC CCC AGAAA 
AGGT TCGT GGGATGGCTATT AAGGAT CT CAATGCAGAACT GT T TGATTGTAAGGACAGTGGGATTGT AAG C 
AC CATT CAGATAACCACT GC AATACCC AAC AC AGTGTATC CT GCCC AAGGAC AGT GGCCAGCTC CAGT GAC 
CAAAC AGCC AGAT AT T AAGAACCCC AAGCT CACT AAGGAT CAAC AAACCACAGGGAGT CC CTCAAG AAAAA 
C AAT T AC AAT TACTGT GAAGTCTGT C ACCT CTGATACCAT TC AT AT CT CTTGGAAACTTGCT CT ACCT AT G 
AC T GCT TT GAGACT CAGC TGGCTT AAACTGGGCC AT AGCCCG GCAT TT GG ATCT AT AACAGAAAC AATTGT 
AAC AGGGGAACGCAGT GAGT ACTT GGTC AC AGCCCT GG AGCCT GAT TC ACCCT ATAAAGTAT GC AT GGTTC 
CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GCAGAAGCTGGCACTAAGAAGGACAACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAAT 
AAGC AATGAACC C ATC TCGAAGGAGGAGTT TGT AATAC AC ACCAT ATT TCCT CC TAATGGAATGAATC TGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TCAC AC TCATGATGCT GAAGGACT C ACAGC AG ACTT GT GT TT TGGGTT T T T T AAACCT AAGGGAGGTGATG 
GT 
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FIGURE 132 

M I S AAWS I FL I G TK I GL FLQVAPLSVMAKS C P S VCRCDAG F I YCNDRFL TSIPTGIPE DAT T L YL 
QMNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFS YLRQLYRLDMSNNNLSNLPQG I FDDLDNI TQL I LRNNPW YCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAEL FDCKDS GIVS T I QI T TAI PNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGS PSRKT I T I TVKSVTS DT I HI SWKLALPMTALRLS WLKLGHS PAFGS I TET 
fck IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
O PYKNPNLPLAAI IGGAVAIiVTIALLALVCWYVHRNGS LFSRNCAYSKGRRRKDDYAEAGTKKDNS 

%K? ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

If! 

w 

5 II Important features of the protein: 

%? i 

uj Signal peptide: 

^1 amino acids 1-28 

Transmembrane domain: 

o 

*«s amino acids 531-552 

f% N-glycosylation sites. 

ill amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 

W Tyrosine kinase phosphorylation site. 

m 

amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 123 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTC ATG TCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACTCCAACAGAGCCAGGAC TAT AT CAACCT C T T C TGCGCCAACAT GAT GGAC TTGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGC TTCGGGGAGCC T GATGC T GAAGAT GAAGAAT TAT C TAAAGCTAT TCAA 
TATCAGCAGCATT TT TCGAGGAGAGTGAAGAGGCGAGAAAAACAATT TCCAGATTC TCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
T T TAG TCCTCATCCC T TAGATCCT GGAGGGCACGGATCACATCCT GGGAAGAAGGCAT C TGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGG GAGGGACCCAGGTGTGAACGGAT GAATAAAG T TCAACT GCAAC TGAAAAAAAAAAA 
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FIGURE 134 



MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAE SKAT IADL IL SALE RAT VFLEQRLPE INLDGMVGVRVLEEQLKSVREKWAQEPLLQPLS LRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCAISnyiMDLNRRAEAIGYAYPTRDIFKENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 
amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 13 5 

GGTCTGAGTGCAGAGCTGCTGTC ATG GCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGATTTGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAAT TACAT CAAAACATCAGAGGT TGTCAGAC T GCC CTAT CCTC TCCAAATGAAATC T T CAGGT 
|« CCACCT TCT TAC T T TAT TAAAAGGGAATCGT GGGGCT GGACAGAC TT T CTAAT GAACCCAAT GGT 

y TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 

CT GACATGAGACGGGAAATGGAGCAGT CAAT GAATATGC T GAATT CCAACCATGAGTTGCC TGAT 

111 

; , f GT TT CT GAGT TCATGACAAGACT CT T C T CT T CAAAATCAT C TGGCAAAT CTAGCAGCGGCAGCAG 

|f| TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
\tl AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 

GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

1, AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 

4,1 

1 » GAGGTCAATAT TGATGTCACTGAATTAATTACAGTGTCCTATAGAAAATGCCATTAATAAAT TAT 

Q atgaactactatacattatgtatattaattaaaacatcttaatcc^^ 

|y AAAAAAAAAAAAAAA 

HI 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHD I PS GSYWEWS PAYRFDPVRVD ITS KGKMRARYVNY IKTSE 
VTOLPYPLQMKSSGPPSYFIKRESWGWTDFLMNP3MVMMMVLPLLIFVLLPKVW 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

l=| Transmembrane domai n : 

isj amino acids 161-182 



™# N-glycosylation site. 

amino acids 184-187 

h 

HI Glycosaminoglycan attachment sites. 

Q amino acids 37-4 0, 23 6-239 

i .. s 
W 
$*% 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 



Amidation site. 

amino acids 238-241 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCT GT GT GATCGCCACAAACC TTCAGGAAATACGAAATGGAT T T TC T GAGATACGGGGCAG T G 
TGCAAGCCAAAGATGGAAACATT GACATCAGAATCTTAAGGAGGACTGAGTCT TTGCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
CGAT CAAGAAGGACC T CCGGCTC T CT CATGC CCACATGACATGCCAT T GTGGGGAGGAAGCAATG 
AAGAAATACAGCCAGAT TC T GAGT CACT T T GAAAAGC TGGAACCT CAGGCAGCAGT TGTGAAGGC 
TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTG 
C T GCTAAGAATAT TCGAGGT CAAGAGC TCCAGTC TTCAATAC CT GCAGAGGAGGCATGACCCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
T GTAATATCT TTCTGCTATTGGATATAT TTATTAGTTAATATAT TTAT TTATT TTTTGCTATT TA 
AT G TAT T TAT T T T T T T AC T T GG AC AT G AAAC T T T AAAAAAAT T C AC AGAT TAT AT T T AT AACC T G 
ACTAGAGCAGGTGATGTAT TTTTATACAGTAAAAAAAAAAAACCT TGTAAAT TCTAGAAGAGTGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATT TGAAAT TGAACCAATGACTACT TAGGATGGGT TGTGGAATAAGT TT TGATGTGGAAT TGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAAT CCTACACGGCCAGCAT GT AT T T C TACAAATAAAGT T T TCT T TGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 1*8 

MRQFPKTS FDI SPEMS FS I YSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPE I F 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
S LANS FLT I KKDLRLSHAHMTCHCGEEAMKKYS QI LS HFEKLE PQAAWKALGELD I LLQWMEET 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 

CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 

TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 

TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 

CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC. 

AC CACC T CT C TTCAGTAAAG TT GT TAT T GT T C TGATAGAT GCCT TGAGAGAT GAT TTTGTGTTTG 

GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 

TT TGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCC TCGAAT CAAGGCAT TGATGACGGGGAG 

CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 

TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGTCTTT TATGGAGATGAAACCTGGGTTAAAT TA 

T T CCCA^AG CATT TT G T GGAATATGAT GGAACAACCTCAT T T T T CG TGT CAGAT TACACAGAGGT 

GGATAATAATGTCACGAGGCATT TGGATAAAGTAT TAAAAAGAGGAGATTGGGACATAT TAATCC 

TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 

AAGCT GAGCGAGATGGACAGCGT GCT GAT GAAGATCCACACC T CACT GCAGT CGAAGGAGAGAGA 

GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 

GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 

CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 

ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 

CAAT GAGAGAGCAGTTGAGATT T T TACAT T T GAATACAGT GCAGC TTAGTAAACTGTT GCAAGAG 

AATGT GCCGTCATATGAAAAAGAT CC T GGGT T T GAGCAGT T TAAAAT G T CAGAAAGAT T GCAT GG 

GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 

TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 

TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 

CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 

CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 

GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 

AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 

TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 

CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 

TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 

GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 

T CGGACACC TT CAT TCGT TAGC T GGGGAGTGGT GGT GAGGCAGT GAAGAAGAGGCGGATGGT CAC 

AC T CAGAT CCACAGAGCCCAGGATCAAGGGACC CAC T GCAG TGGCAGCAGGACT GT TGGGCCCCC 

ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 

CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 

CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 

CTGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAATTGAGAAAAAAAA 
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FIGURE 1AO 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRS SARAEHGAEPPAPEPSAGAS SNWTTLPPPLF 

SKWIVIjIDALRDDFVFGSKGWFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKAI^ 

VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 

TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 

NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
T CCCTATAGAAAACAAC TGCCAGCACCT TAAGACCAC TCACACCT TCAGAGT GAAGAACT TAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAG T TCCAGATAAAAAC TACATACGCCCAGAGATC T TCT T TGCATT AGCCT CATCCT TGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAG T CAT CCAT CCC T TCAGCTGAAGAAGGAGAAAC T GATGAAGC T 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GT T GGG GT GACAGATAAATT TGAGAACAGGAAACACAT TGAAT T T TCAT T TCAACCAGTT TGCAA 
AGCTGAAAT GAGCCCCAGT GAGGT CAGCGAT TAGGAAACTGCCC CAT T GAACGCC T TCCTCGCTA 
AT T TGAAC TAAT TGTATAAAAACACCAAAC CTGC TCAC T 
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FIGURE 14.2 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALAS SLS S AS AEKGS P I LLGVS KGE FCL YCDKDKGQSHP S LQLKKEKLMKLAAQKE SARRP PI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-rayristoylation site. 

amino acids 50-55, 87-92 

Interleukin- 1 

amino acids 37-182 
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FIGURE 

C TAGAGAGTATAGGGCAGAAGGATGGCAGATGAGT GAC TCCACAT CCAGAGCTGCC TCCC TT TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGT TTGC TGTCCACACAT TCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTT GAATTCC TGGAAGGAGCAGGTGGAGTCCAAGAC TGTATTCTCAATGGAGCTACTGCT 
GGGGAGMCTAGGTGTGGGAAATTTGAAGACGACATTGACAACTGCCATTTCCAAGAAAGCACAG 
AGCT GAACAATAC TT TCAC CTGC T TCT T CACCAT CAGCACCAGGCCCTGGATGAC T CAGT TCAGC 
CTCCTGvi\ACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGGATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
AC AC AT C T T GAGC C T AAT CAT G TAG T G TAG AT CAT T AAAC AT C AGC AT T T T AAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



APP_ID=10063597 



Page 293 



FIGURE iaa 

MLGLPWKGGLS WALLLLLLGSQ I LLI YAWHFHEQRDCDEHNVMARYLPAT VE FAVHT FNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 

CTGT GCAGC TC GAGGC TCCAGAGGCACAC T CCAGAGAGAGCCAAGGTT C TGACGCG ATGA GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAAT CAAGTGGAACCGGAAGGCCC T GCC CAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGT TCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTC TGG 
|_S, CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
O TCGGGTCACCATGCACCAGCCAGTGCTCCTGTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
j3 AAGC T TGCCAGGAGGC TGGCAGTACAGAGCGCAGCAGCGAGCAAATCC T GGCAAGT GACCCAGCT 

f * CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
|f| TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
*?| CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
^| GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
^ GTGTATTCCCGTTTTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
7% ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 

GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
|y CTCTAATGAAAT TGTGAAAGCTCCATGTT TAGAAATAAATGAAAACACCTGA 

ru 
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FIGURE 146 

MRKHLSWWWIATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQE LCSLKHCE FWLERGAGLRVTMHQPVLLCLLAL I WLMVK 



1*$ 

5-s s 



Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 



Pi N-myristoylation sites. 



amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCT GAT GAT T TATAGACTC AAAGAAAACT CATGT TCAGAAGCT C TCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 




ill 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 



I4. N-myristoylation site* 

Cl amino acids 62-68 

O 

m 

w 

in 

m 
H 



Q 

m 
o 
w 

o 
ill 
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FIGURE lao 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACGCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAATTAAGTGCAGAAAGACCT T TGAAT GAACAGATTGCT GAA 
GCAGAAGAAGACAAGAT TAAAAAAACATATCCT CCAGAAAACAAGCCAGGT CAGAGCAACTAT TCTT T 
T GT TGATAACT TGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAAT CTA 
TAAGAAGCTCCCCACTTGATAATAAGTTGAATGTGGAAGATGTTGATTCAACCAAGAATCGAAAACTG 
ATCGATGATTATGACTCTACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCA 
TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 
AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
CAAGCACATACACTGGAAGAT GAAGTAGCAGAGGTT TTACAAAAATTAATC TCAAA.GGAAGCCAACAA 
TTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCAGAGAAAG 
T GACT CCAATGGCAGCAATT CAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTAT CTAACACA 
TTAACCTTGACAAATGGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGACAACTTTGAGGAACTCCA 
ATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACT GATTACTAT CATGAAAACACTGATT GACT TTGTGAAGAT GATGGTGAAATATGGAACAATATCT 
CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAA.GAAA 
CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 
GAT GATAACTCCAAC CCAGGAGGAAAGACAGAT GAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CAT CAGAAAAAATAT TGAAT GGT TGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCC^GGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCT GAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS IRS S PLDNKLNVEDVDS TKNRKL I DD YDS TKSGLDHK 
FQDDPDGLHQLDGTPLTAED I VHKI AAR I YEENDRAVFDKI VSKLLNLGLI TE S QAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKS IDSEKEAKEKETLI T IMKTLI DFVKMMVKYGT I SPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 

CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAA.GGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
T GT CAGAC T CACCCAGCT T C CCGAGAATGGT GGC TGGAATGC CC CCAT CACAGAC T TCTAC T T CC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
O CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
O TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
fi% GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
Zl GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
?Jf TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
fef GAAAT G GC T C GAGC T C AGAAGAT AAAAGAT AAG TAGGG T AT GC T GAT C C T C T T T TAAAAACC CAA 

%S GATACAAT CAAAATCCCAGAT GC TGGTC TC TAT T CCCATGAAAAAGT GC TCAT GACATAT TGAGA 

%l AGACCTACT TACAAAGTGGCATATAT TGCAAT T TATT TTAATT AAAAGATACC TAT TTAT ATAT T 

' T C T T T AT AGAAAAAAG T C T GGAAGAG T T T AC T T CAAT T G T AGC AAT GT C AG G G TGG TGGCAG TAT 

Z zx AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
W AT TCCTTGTATAAAAATAAGAAAAGAAATTAATC TTGAGGTAAGCAGAGCAGACATCATCTCTGA 

111 TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
i!| T T GTAGT AG TGATCAGGAAACAGAT C T CAGCAAAGCCAC T GAGGAGGAGGC T GT GC TGAG TT T GT 

iA GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 

GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
y AT TGTGTCCCCC T CAAAT T CACATCC T TC TTGGAATC T CAG T CT G T GAG T T TAT TT GGAGATAAG 

Hi GT CT C T GCAGATGTAGT TAG TTAAGACAAGG TCAT GC T GGATGAAGGTAGACC TAAAT T CAATAT 

GACT GGT T T CC T TGTAT GAAAAGGAGAGGACACAGAGACAGAGGAGAC GCGGGGAAGAC TAT GTA 
AAGATGAAGGCAGAGAT CGGAGT T T T GCAGCCACAAGC TAAGAAACACCAAGGAT TGTGGCAACC 
AT CAGAAGC T TGGAAGAGGCAAAGAAGAAT T CT TCCC TAGAGGCT T TAGAGGGATAACGGCTC TG 
C TGAAACC T TAAT CTCAGAC TT CCAGC CT CC TGAACGAAGAAAGAAT AAAT T TCGGCTGT T T TAA 
GCCACCAAGGATAAT T GGT TACAGCAGC TC TAGGAAAC TAATACAGC T GCTAAAATGAT CCC T GT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGTTATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACAT TAG TGAACCTAGAAGCAGAGAC TC TGTGAGATAAT CGATGT TTGTTGTTTT 
AAGT TGCT CAGT T T TGGTC TAACT T GT TATGCAGCAATAGATAAAT AATAT GCAGAGAAAGAG 
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FIGURE 1*2 

MVLSGALC FRMKDSALKVL YLHNNQLLAGGLHAGKVI KGEE I S WPNRWLDAS LSPVI LGVQGGS 
QCLSCGVGQEPTLTLEPWIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAP I TDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 



Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGC AATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACGGCACCTTCATGCTGGCTAAGGAGGGTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
T T GGAGAGAGTGGAGAGAT CAAAGCAATT GGAGAAC TGGAT T T GC TGT T TAT GTC TC TGAGAAAT 
GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAAT TAGAT GCCCCAAAGCGAT T TT T T TT AACCAAAAGGAAGATGGGAAGC CAAAC TCCATCATG 
AT GGGTGGAT T CCAAATGAACC C CT GCGT TAGT TACAAAGGAAAC CAAT GCCAC T T T TG TT TATA 
AGACCAGAAGGTAGACTTTCTAAGCATAGATATTTATTGATAACATTTCATTGTAACTGGTGTTC 
TATACACAGAAAACAATT TATT TTTTAAATAAT TGTCTTTT TCCATAAAAAAGATTACT TTCCAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
T G TAT T TAT TAT TAT TAT AAGAGT GCAT T T TAT T TATATCAT T T TAT TAAT ATGGAT T TAT T TAT 
AGAAACATCAT TCGATAT T GCTACTT GAGT GTAAGGC TAATAT T GATATT TATGACAATAAT TAT 
AGAGCTATAACATGT TTATT TGACCTCAATAAACAC TTGGATATCCC 
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FIGURE 1*4 

MAALQKSVSSFmGTLATSCLLLLALLVQGGAAAPISSHCRLDKSNFQQPYITNRTFMIAKEASL 
ADISINTDWLIGEKLFHGVSMSERCYL^KQVLNFTLEEVLFPQSDRFQPYMQEWPFIJ^LSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide : 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
T GCC T GAGGGC TG TGCTGAGAGGGAGAGGAGCAGAGAT GC TGGT GAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGGCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCAC T TT C TAGATAT T TC CCCC T TGCTGGAGAAGAAAGAGCCCC TGGT T T TAT T 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T T GACAT GGATGAT TC TGAGGAGGAAGCT GT TAT TGAATG TATAGAGAT TTATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE is6 

MRERPRLGEDSSLISLFLQWAFIAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNS RAI S PWRYELDRDLNRLPQDLYHARCLCPHCVS LQTGS HMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 



Ml 
SO 



O 

■■, % ;; 



III 



Important features of the protein: 
Signal peptide: 

amino acids 1-32 



Q N-glycosylation site. 

O amino acids 136-140 



Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGAC T T GAGGGACC TC CGAGTAGAACCTGT TACAAC TAGT GT TGCAACAGGGGAC TAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT TT GTGT GACGGGCAAAAGCAACT TCCAGTCC TACAGC TGTGT GAGGT GCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCGTACATCGGCTTCCCTGTA 
GAGCT GAACACAGTCTAT T T CAT T GGGGCCCATAATAT TCC TAAT GCAAATAT GAATGAAGATGG 
§4 CCCT T CCAT GT C T GT GAAT T T CACCT CACCAGGC TGCC TAGACCACATAAT GAAATATAAAAAAA 

Q AGTGT GTCAAGGCCGGAAGC CT GT GGGATCCGAACATCAC T GCT T GT AAGAAGAAT GAGGAGACA 

«* G TAGAAGT GAACT TCACAACCACT CCCC T GGGAAACAGATACAT GGCT CT TATCCAACACAGCAC 

f\ TATCAT CGGGT T T TC TCAGGTGT T T GAGCCACACCAGAAGAAACAAAC GCGAGCT T CAGT GG TGA 

m TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
=|f| AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
% 1 GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
JL CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 

TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
hi T CACACAAT T TG T TAC T TCAC T GAAT T TCT T CAAAACCAT TGCAGAAGT GAGGTCATCC T TGAAA 

||| AGTGGCAGAAAAAGAAAATAGCAGAGATGGG TCCAGT GCAG TGGC TT GCCAC T CAAAAGAAGGCA 

tl GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
* GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

GT GAT C TAAGAAGCCAGAT T CAT C TGCACAAATACGTGGT GGTC TACT T TAGAGAGATTGATACA 
AAAGACGAT TACAATGCTCTCAGT GT C TGC C CCAAGTACCAC CT CATGAAGGAT GCCAC TGCT TT 

CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 1*8 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 

LRADASIRLLPCATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 

NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 

SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICY FTEFL 

QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 

FNLFCSDLRSQIHLHKYWVYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 

AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACT T GGGC T GCATCAATGCTCAAGGAAAGGAAGACATC TC CAT GAAT T CCGTTCCCAT CCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGC6TCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
ACTCAGCTGAAGAAG 
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FIGURE 160 



MT VKTLHGPAMVKYLLLS I LGLAFLSE AAARK I PKVGHT FFQKPE SCPPVPGGSMKLD I G I INEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 

ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAAC TACC CAGCACAGCCCCC T CCGCCCC CT CT GGAGGC T GAAGAGGGAT T C 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC T GAAGAT GAGGAAAAGTT T GGAGGAGCAGC TGACTCAGGGGTGGAGGAGC C TAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
1 : TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
IZ TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
H GGAAC T CAACCACACACAGCAGCT GCCT GCCC T GCCC T GGC T CAAC GT G TCAGCAGATGG TGACA 

Cl ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
01 GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 

CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
7% CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
in CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
%|| GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
%J TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 

AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
L, CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
H GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
If! CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
O CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
s\l CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA. 
|K CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
H CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
111 CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 

CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQVVLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVWDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLWSADGDl^HLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQ AARLRLLTLQ S WL LD APC S LP AEAALCWRAP GGD PCQ PL VP PL S WEN VT VD KVLE FPLL KGH PNLC VQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSLILLLKKDHAKGWLE^LKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-58 6 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites . 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 

GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GT CAGT GC GGGAGGCCGGTCAGCCAC CAAGATGACTGACAGGTT CAGCT CT CTGCAGCACAC TAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T T CTACCAC T TAGAGC TCCAGGT C AACC GCACC TACCAAAT GCACC TT GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
. : CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
U CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
O TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
ffl AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCGCGCAGGAGC 
fj t l T CCACAGC GGCATAGCC T GT CC GAGAT CACC TAC T TAGGGCAGC CAGACATC TCCATCC TCCAGC 

tZ CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
uJ GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
\| CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CACC T T AGGCC TAAAGGT CAGC TT CAGAAAGAGC CACCAGC T GGAAGC T GCAT G T TAGG TGGCCT 
t, TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
y CC CTGGGGATT TGCACAGACAGAACAT C T GACCCAAAT GTGC TACACAGTGGGGAGGAAGGGACA 

111 CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
S3 CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
Ul TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
J! GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
^ AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
fll ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAA.TGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAAT CTAGCTCGACAGAGCAT GAGGCCCC TGC C TC T TCT GTCAT TGT T CAAAGG TGGGAAGAGA 
GC CT GGAAAAGAACC AGGC C TGGAAAAGAAC CAGAAGGAGGCTGGGCAGAACCAGAACAACC T GC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
T T CCCAGCCAGGGCAAC TGCCTGACGT TGCACGAT T T CAGCT T CAT TCC T C T GATAGAACAAAGC 
GAAAT GCAGGTCCACCAGGGAGGGAGACACACAAGCC T T T TCT GCAGGCAGGAGT T TCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAA.TTCCAGC 
CTCAC CAC T CACAAGC TGTGT GACT T CAAACAAATGAAAT CAGT GCCCAGAACC T CGGTTTCCTC 
AT C T G TAAT GT GGGGATCATAACACCTACC TCATGGAGT TGTGGT GAAGAT GAAAT GAAGT CAT G 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 



APP ID=10063597 



Page 313 of 318 



FIGURE 164 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNS LNVQRVLT FQPLRFI QEHVL I PVFDLS GPS S LAQPVQYS Q IRVSGPRE PAGAPQRHSLSE I T 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence . 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 16* 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAAC ATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGT TGAACACAGT TTTGGATATT TTCCAAAAGAT TTGATCAAGGTACT TCATAAATACACGGA 
AGAAGAGCTACATAT TCCAGCAGATGAGACAGACTT TGTC TGCT TTGAAGGAGGAAGAGATGATT 
TTAATAGTTATAATGTAGAAGAGCT TT TAGGATCTT TGGAACTGGAGGACTCTGTACCTGAAGAG 
TCGAAGAAAGC TGAAGAAG T T TC T CAGCACAGAGAGAAAT CTCC T GAGGAGT CTCGGGGGCGTGA 
AC TT GAC CC TGTGCC TGAGCCCGAGGCAT TCAGAGC T GAT T CAGAGGAT GGAGAAGGTGCT T TCT 
CAGAGAGCACCGAGGGGC TGCAGGGACAGCCC TCAGC TCAGGAGAGCCACC CTCACACCAGCGGT 
CC TGC GGCTAACGCTCAGGGAGTGCAGTC T TCGT TGGACAC TT T T GAAGAAAT TC TGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGAT GCT TACAAAGTCCT GAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGT TAT T 
CAT T AC AG CAAAG GAT TTCGTTGG CAT C AAAAT C T AAG T T T G T T T T AC AAAG AT T G T T T T TAGT A 
C TAAGC TGCC T T GGCAGT T TGCAT T T T T GAGC C AAACAAAAAT AT AT TAT TTTCCCTTC TAAG T A 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSD^ 

FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Ami da ti on site. 

amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCT CACT TGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCGGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GC CT T GAGAATGAAGCAGAACAGAAGT TAATAGAGAGCATGTT GCAAAACC TGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
C T TGGGGGT CCC TAC C T T TACCAGTGGAAT GATGACAGGT GTAACAT GAAGCACAATTATAT T TG 
CAAG TAT GAACCAGAGAT TAAT CCAACAGCCCC TG TAGAAAAGCCT TAT CT TACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAGTAAAGGAAGAACAAAAAC T AGT CCAAACCAGT CTACACT GTGGATT TCAAAGAGTA 
C CAGAAAAGAAAGTGGCAT GGAAGT ATAA TAAC TCAT T GAC T TGGT T CCAGAAT T T TG TAAT T C T 
GGAT C TGTATAAGGAAT GGCAT CAGAACAATAGC T TGGAAT GGC T T GAAAT CACAAAGGATC T GC 
AAGAT GAAC TGTAAGC TCCCCC T TGAGGCAAATAT TAAAGTAAT T T T TATAT GTC TAT TAT T TCA 
TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
AC TT CAAAC T TCAAGCAAAT GAAAT GGACAATGCAGAT AA&GT T GT TAT CAACAC G TCGGGAGTA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
T GTATAT T GTAT T GAAATT TACAGT GT GCAAAAG TAT T TTAC CT TT GCAT AAGT G TT T GATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAAC T TAT TAC T G T T G T C AAC T G AAT T C AC AC AC AC AC AAAT AT AG TAC C AT AG AAAAAG T T T G T 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGC TAT T TCAT TAAGT GTGATATAAACC TCC TCAAACATT T TAC T TAGAGGCAAGGAT 
T G T C TAAT T T C AAT T G T GCAAG AC AT G T G C C T TAT AAT TAT T T T TAG C T T AAAAT T AAAC AG AT T 
T TGTAATAATGTAACT TTGT TAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
T GACATACACAATATAAATCATAT GT C TT CACACGT TGCC TAT ATAATGAGAAGCAGC TC TC TGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
AT T GACAC T GGAGGCAGATAGT TGCAAAG T TAGT C TAAGGT TT CCCTAGCT GTAT T TAGCC TC TG 
AC TATAT TAGTATACAAAGAGGT CAT GTGGT TGAGACCAGGTGAATAGTCAC TAT CAG T GTGGAG 
ACAAGCACAGCACACAGACATT TTAGGAAGGAAAGGAACTACGAAAT CGTGTGAAAATGGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 

MSRWS LLLGAALLCGHGAFCRRWSGQKVC FADFKHPC YKMAY FHELS SRVS FQEARLACE SE 
GGVLLS LENEAEQKL I E SMLQNLTKPGTG I S DGDFW I GLWRNGDGQTSGACPDL YQWSDGSNS Q 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
P YLTNQPGDTHQNVWTEAG 1 1 PNL I YWI P T I PLLLL I L VAFGT CC FQMLHKS KGRTKTS PNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 



Q Transmembrane domain: 

4 J amino acids 214-235 



N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 



■ amino acids 266-269 

& 



N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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